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THE USE OF MANIPULATIVE MATERIALS AND STUDENT PERFORMANCE IN THE 
ENACTIVE AND ICONIC MODES. Barnett, Jeffery C. ; Eastman, Phillip M. 
Journal for Research in Mathematics Education , v9 n2 , pp94--l02, March 
1978. " 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by John 
A. Dossey, Illinois State University. 

1. Purpose 

The purpose of this study was to test v/hether or not "preservice 
elementary teachers need actual physical experience with manipulative 
materials to be able to demonstrate numerical and structural properties 
at the concrete (enactive) and semiconcrete (iconic) levels of abstrac- 
tion." While the progression through these representational modes of 
enactive, iconic, and syn^bolic knowledge has often been suggested as 
an efficacious pattern for mathematics instruction, its use in the 
training of preservice elementary teachers has not been studied. 

To these ends, the following three hypotheses were stated: 

HI: In measures of ability to demonstrate ^numerical and 
structural properties of the four basic arithemtic 
operations in the enactive m.ode, there are no signif- 
icant differences between subjects who are required 
to operate in both the enactive and iconic modes and 
subjects who are restricted to operating in only the 
iconic mode. 

H2: In measures of ability to dem.onstrate num.erical and 
structural properties of the four basic operations 
in the iconi.: mode, there are no significant differences 
between subjects who are required to operate in both 
the enactive and iconic m.odes and subjects who are 
restricted to operating only in the iconic m.ode. 

H3 : In measures of mathem.at ical achievem.ent , there are 
no significant differences between subjects who are 
required to operate in both the enactive and iconic 
modes and subjects who are restricted to operating 
only in the iconic m.ode. 

2 . Rat ionale 

Given that a large portion of the professional journals, textbooks, 
and activities at professional conventions are devoted to the use of 
manipulative materials in the teaching of mathematics, the researchers 
report that the ai^iount of inform.ation on the preparation of preservice 
elementary teachers to use these aids effectively is both limited and 
contrad ic tory . 



In addition, the adoption of Bruner's representational modes at both the 
elementary and preservice teacher textbook levels is not based on a sound 
foundation of research findings. The researchers indicnte that current 
practice, especially in the training of preservice elementary teachers, 
is one in which practitioners are operating on ''hunches" rather than on 
an empirically tested base. 

^ • Research Design and Procedure 

Two sections of a mathem.atics methods course for preservice elem.en- 
tary teach','rs were selected as a research population. One section was 
randomly designated as an exper im.ental group and was provided with both 
dem.onstrat ions of how to use m.anipulat ive m.aterials and a laboratory 
period during which actual "hands-on^' experiences with the m.aterials took 
place. The control group received the dem.onstrat ions of how to use the 
materials, but they received no first-hand experience v;ith the m.aterials. 
Both groups m.et with one of the researchers twice weekly for a one-hour 
lecture and then each group m.et with one of the researchers once a week 
for a two-hour laboratory session. Careful attention was paid to the 
equivalence of verbal information provided in both sess:'-ons. 

The topics of instruction in the study period were three laboratory 
units on the properties of the whole num.ber system, addition and sub- 
traction of whole num.bers, and multiplication and division of whole 
num.bers. Students in the exper im.ental group com.pleted the laboratory 
exercises first with the concrete m.aterials (enactive) prior to drawing 
a picture of the m.aterials in the laboratory m.anuals (iconic). Students 
in the control group worked the exercises without the concrete materials, 
drawing pictures of what they would do (iconic) in their laboratory 
manuals . 

The students were pre-tested with a 40-item. multiple-choice content 
test at the beginning of the exper im.ental period. The resulting score 
was used as a covariate in the analysis of the data collected. Two 
additional tests were given on the last day of the study. Test I con- 
sisted of 20 item.s sim.ilar to those covered in the laboratory m.anuals. 
Each item required the student to exhibit knowledge of a m.anipulative 
aid and a num.ber property. Students responded to these item.s by drawing 
a picture of how the m.anipulatives could be used to illustrate the 
property. (Test I had a Cronbach alpha reliability of .72 for the 
groups com.bined.) Test II consisted of 20 multiple-choice item.s on 
the mathematical content covered in the three units studied. (Test II 
had a Cronbach alpha coefficient of 0.47 for the exper im.ental group 
and 0.54 for the control group.) 

Nineteen students from each group were then random.ly selected for 
an interview on use of the m.aterials in dem.onstrat i ons of whole num.ber 
properties. They were asked to show how they would use the m.aterials 
to explain the properties to children. 



4 . Find ing s 



The three hypotheses were tested using a linear regression model 
with the pretest used as a coviriate. The only null hypothesis rejected 
was H3 , the hypothesis dealing with the effect of the treatments on the 
subjects' m.athem.at leal acheivem.ent . The results of the analysis showed 
that the students in the experimental (enactive and iconic) group per- 
formed significantly better than those in the control (iconic only) 
group (a< 0. 025) . 

5 . Inter pre tat ions 

The investigators felt that the results indicated that preservice 
elem.entary teachers do not necessarily have to interact with manipulative 
materials in order to function at either the enactive or iconic m.ode^ in 
the classroom, (noted . by the results on the Laboratory Test and the 
interview). They further stated that the Concepts Test difference is 
puzzling, but due perhaps to the low reliability. They also stated that 
the students in the exper im.ental group m.ay have actually learned the 
concepts better because of the m.aterials, but that this difference in 
learning was not reflected in their ability to function in either the 
enactive or iconic m.odes. 

While their study did not focus on attitudes, the researchers noted 
that the students in the control group seem.ed to have a decline in 
attitudes over the course of the experim.ent. 

As a summary, they suggested that the results tend to indicate that 
an instructor of a preservice elem.entary m.ethods class needs to weigh 
carefully the benefits that m.ight be derived from the use of m.anipulat- 
ives against the increase in time used and the decrease in other content 
which will be able to be covered. 



Critical Comjp.en tary 

While the stated purpose of the study is both interesting and 
worthwhile, several problem.s intervene to raise serious questions about 
the results and interpretations m.ade from. them.. Som.e of these concerns 
are : 

1. Were all the tests used pre-tested on an equivalent 

sample of studencs to get estim.ates of their reliability? 
The reliability of the Concepts Test (Test II) is so low 
that little meaning can be associated with the related 
f ind ings . 

?. Working from inform.ation provided in the paper, it 
appears that the study involved less than 10 hours 
of teaching and laboratory work. Of this tim.e, over 
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40 percent was spent in equivalent instruction in the 
classroom. This does not appear to be long enough 
a period of time to accrue the differences that were 
anticipated by the representational m.ode difference. 

3, Were the m.aterials used unfam.iliar to m.em.bers of both 
groups, or had the students in one or both groups seen 
the materials earlier in the m.ethods course or in a 
prior m.athem.atics content course? Were any checks 
made for this? 

4. Since the item.s in Test I and the Interview Test were 
so similar to those handled in class, the failure to 
obtain differences here may be due to a training effect 
coupled with low cognitive dem.ands in the items. 

The conclusions drawn in the interpretations are far too broad 
to be based on the inform.ation provided in the article. VThile conclusions 
provide one view of how the data m.ight be interpreted, one could argue 
that the differences on the Concepts Test are reflective of the students' 
internalization of the concepts. Functioning on the Concepts Test may 
call for a higher level of cognitive functioning than do the other two 
tests. This interpretation, like the one provided in the article, 
is also open to doubt due to the low reliability of Test II. 

While the study of the role of manipulativcs in the preservice 
education of prospective elem.entary teachers is an im.portant area for 
study, this investigation does little to advance our knowledge in the 
area. It serves as a potential model of what might be done with further 
work on design and ins trum.en ta t ion . 



A COMPARISOM OF THREE METHODS OF INTRODUCING TWO-DIGIT NUMERATION. Barr, 
David C. Journal for Research in Mathematics Educa t ion , v9 nl, pp33-43, 
January 1978. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by Robert 
B. Ashlock, University of Maryland. 

1 . Purpose 

The stated purpose of this study was to compare three m.ethods of 
introducing two-digit num.eration. The researcher sought to find out if 
there is any difference between exper im.ental treatments with respect to 
achievement in num.eration skills and application of num.eration principles, 
both imjnediately after instruction and four weeks after instruction. He 
also hoped to iaentify advantages of particular treatm.ents where dif- 
ferences were indicated . 



2 . Ra t lonale 

It is assum.ed in this study that children need to have a thorough 
understanding of the Hindu-Arabic num.eration system., since this knowledge 
is fundam.ental to understanding the algorithm.s of arithmetic. It is 
further assum.ed that, in order to help children acquire an understanding 
of base ten, activities which involve grouping by tens are preferred 
to those which involve a m.ore general concept of grouping. 

Although m.ost children entering school can count rationally to at 
least ten, experts disagree whether counting past ten should precede or 
follow the introduction of two-digit numerals. Therefore, research is 
needed which com.pares varied sequences for these aspects of instruction. 

3. Research Design and Procedure 

Children in eleven kindergarten classes from, four schools in three 
school districts were given a battery of four pretests. Data were 
elim.inated from, the study for children who could not count a set of ten 
and for children who could already do tasks the treatm.ents were designed 
to teach. Each rem.aining child was assigned to one of five cells and 
members of the five cells were random.ly assigned to one of three ex- 
perimental treatments: 

- Treatment A began with the counting of sets containing m.ore 
than ten m.em.bers, and writing the corresponding sym.bols by 
association. Later, instruction on the m.eaning of the sym.bols 
were given. 



~ Treatm.ent B began with grouping objects into tens and ones, 
and saying how many tens and ones. Children then wrote Vhe 



numeral a<^ the number of tens and the number of ones. 
Finally, instruction was given involving the usual num.ber 
nam.es . 

- Treatment C beg an with the counting of sets conta in ing m.or e 
than ten mem.bers, and writing the corresponding symbols by 
association. Instruction then focused on grouping objects 
in sets of ten and counting by tens. Finally, children were 
taught the m.eaning of the sym.bols they had been writing. 

Each treatm.ent group received ten 20-m.inute lessons. Posttests were 
adm.inistered to test both achievement in num.eration skills and application 
of these skills to other contexts such as the num.ber line. In all, three 
maintenance lessons were given before retention tests were adm.inistered 
four weeks after the posttests. A MANOVA was perform.ed on posttest and 
on retention test data from, the three groups, with the m.ean treatm.ent 
group score within a class as the exper im.ental unit. A d iscr im.inant 
analysis was carried out on retention test data. 

4 . Findings 

Although Treatm.ent C obtained the highest posttest score on the 
criterion m.easure and Treatment B the lowest. The MANOVA showed no 
significant difference between treatm.ents at the posttest stage. 

For the retention tests, the overall difference between treatm.ents 
was found to be significant (p < .05). The first d iscr im.inant function 
(which was found to be significant) gave the largest value for Treatment 
C. The author then stated that "Treatment C was the m.ost effective 
although not necessarily significantly so.'' 

5 • Interpretat ions 

The author notes a trend at the posttest stage which continued to 
the retention test stage, by which tim.e there was a significant overall 
difference between treatm.ents. The difference is attributed to achieve- 
ment in num.eration skills rather than application of those skills. 

The relative weakness of Treatm.ent B is interpreted as supporting 
the findings of Rathm.ell (1972), The author suggests that prior ex- 
periences of children with counting and reading num.ber nam.es m.ay m.oke it 
difficult for them, to perceive Treatm.ent B as something different from 
what they already know and can do; pre-existing counting behaviors 
continue unm.odified. 

To the author, it seem.s that counting more than ten objects is 
a well-entrenched skill as children begin school and teachers should 
build on that knowledge by introducing m.ore efficient counting procedures; 
that is, by teaching them, to count by tens as was done in Treatm.ent C. 
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Cr it teal Commentary 

This study adds to a growing body of research concerned with teach- 
ing multi-digit numerals so that children have a usable understanding 
of what such numerals represent. At the very least, the study reminds 
us of two things: we must not forget there are times whefi careful ex- 
planatory teaching is effective and we would always do well to consider 
ways we can build on what children alreacy know and can do. 

The report does not indicate the results of Treatment C were sig- 
nificantly better than the results of either of the other treatments. 
Following the Significant M-\NOVA finding, pair-wise comparisons could 
have been run using Hotelling's t2. The report does not indicate this 
was done, yet the discussion proceeds as if pair-wise comparisons had 
been made; Treatment C is regarded as the "preferable" (p. 43) method. 
Low standard deviations are reported and one wonders if the tests are 
measuring what they purport to measure. Are the tests sensitive enough 
to m.easure the differences that exist? 

The Rathmell study (1972) and the present study are of special 
interest, for the reports seem to suggest it is appropriate for child- 
ren to learn more things by rote than has been assumed* in recent years. 
Iri the present study. Treatments A'and C both involve children writing 
two-digit numerals before they have learned to relate the ten's dieit 
to sets of ten. In these treatments, children count beyond ten and 
learn the symbols "by association." Were the children thereby learning 
just by rote, or were they focusing on patterns within the sequence of 
oral number nam.es and within the sequence of two-digit num.erals? 
Ginsburg (1977) emphasizes that children find their own patterns or 
meanings in learning which is apparently rote. Counting from 1 through 
99 is not just making 99 arbitrary associations. The child observes 
patterns in the sequences of names and symbols, and imputes his or her 
own meanings to what is observed- 

The author included grouping by ten but not by other numbers 
assuming that grouping experiences with other numbers are not appropriate. 
This assumption is based in part on the fact that Rathmell (1972) did 
not find significant differences. Can such non-significant differences 
allow us to say that one procedure is "preferable" (pp. 33-4)? 

The report makes clear that, before the treatments, the chiJdren 
could all count rationally to ten. However, it is not clear in Treatments 
A and C, when children counted sets of more than ten, just how far the 
counting actually extended. Are we to assume the numbers 11 through 99 
were introduced m.uch as we would introduce the numbers 1 through 10? 
This would probably include recording a numeral each time one more is 
joined with a set already counted, with the last number name and the 
numeral telling how many are in the set altogether. For Treatments A 
and C it is unclear, when children counted sets of 10 or more, whether 
they were able to do rational counting with the larger sets. Certainly, 
children can learn to count by rote to 100. Can the writing of a sequence 
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of two-digit numerals be Learned by .i child who dc)K-\ n^a roali::-.' l'::iL 
counting any set, the la.st number nan*.e tells liow r^anv arc in ihr ..l 
If so , wou Id it ma ke a difference as we plan i n s t ru c t i . ; n .\ 1 ^. < ■ < : i ; » ■ d r - . 
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DESIGNING INSTRUCTIONAL METHODS IN M.\THEMATICS TO ACCOMMODATE DIFFERENT 

PATTERNS OF APTITUDE. Becker, Jerry P.; Youn^, Courtney D. Jr. 

Journal for Research in Mathematics Educatio n, v9 nl, pp4"19, January 1978. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by Boyd 
D Holtan, West Virginia University. 



1 ^ Purpose 

The study investigated if two instructional treatments. Guided 
Discovery and Meaningful Didactic, would have an interaction effect with 
two aptitude variables, Reading and Reasoning. 

2 . Rat ionale 

Cronbach suggested that the method of instruction or treatment m.ay 
interact with Lhe learner's individual aptitudes. Since then, researchers 
have Uv^en conducting ATI studies (Aptitude -Treatm.ent-Interac t ion) seeking 
objective evidence of the interactions. Even though m.any teachers feel 
that the interaction relationships exist, researchers have net had a 
successful record of showing them. 



3 - Researc h Des ign and Procedure 

This in^/est igat ion used a regression analysis statistical procedure 
and determined the experimental groups by using a matched-pair procedure. 

The subjects, 62 eighth-grade students, were selected randomly from 
pairs of students who had sim.ilar results on reading and reasoning pre- 
measures. From the "matched'' pairs of students who exhibited "high 
reading-high reasoning", "high reading-low reasoning", "low reading-high 
reasoning", "low reading- Low reasoning", and "middle reading-middle rea- 
soning' , subjects in each matched p ir were random.ly assigned to one of 
the two ex-:)er imental treatments. The two treatments. Guided Discovery 
and Meaningful Didactic, were concerned with devising formulas for finding 
the sum. of rhe firsc n term.s of various num.ber series and were presented 
in a progr med- ins true t ion format. The subjects worked through an 
introductc . program to learn the terminology and sym.bolism and then 
worked th-^ugh the assigned treatm.ent. 

The outcome measures were a recall of term.s, notation, and form.ulas 
(InithiL Learning), transfer requiring the derivation of form.ulas for 
new series (Formula Derivation), and derivation of the nth term formula 
for variuas series (Pattern derivation). 

Data on four other prediction variables were also collected: Tim.e 
on the Introductory Program, Num.ber of Errors on the Introductory Program, 
Time on the Treatment Program, and Mumber of Errors on the Treatment Program. 



The statistical procedures were carried out in two phases. The first 
examined the difference of the two groups using analysis of variance. 
The second phase used a procedure presented by Gujarati to examine whether 
or not the instructional treatments interacted with the aptitude variables. 
Gujarati*s technique (described in the 1970 American Statistician , v24, 
ppl8-52) was used to test if regression lines and planes were parallel. 
If not, it was assum.ed that interaction existed. 



4 . Find ings 

The first phase of the analysis indicated that the two experimental 
groups did not differ significantly in terms of perform.ance on the intro- 
ductory and treatment programs (Guided Discovery and Meaningful Didactic). 

The second phase of the analvsis sought to determ.ine, after re- 
gressing outcome scores on aptitude scores, whether the pairs of re- 
gression lines (using only one aptitude measure with two treatments) and 
the pairs of regression planes (using two aptitude measures with two 
treatm.ents) were the sam.e and, if not, whether significant interactions 
existed . 

The second analysis phase showed significant interactions when time 
on the introductory program was used to predict Pattern Derivation and 
Formula Derivation. Significant interactions were also obtained when 
reading aptitude and errors on the introductory program, were used to 
predict Initial Learning and when reading aptitude and time on the intro- 
ductory program were used to predict Form.ula. Derivation. 

In three of the four significant interactions, t im.e on the intro- 
ductory program was an important predictor. The authors pointed out 
that there was one extreme time score relative to the othejr subjects. 
They repeated the analysis with this extreme score rem.oved and the 
significant interactions were changed to not significant. 

5 . Interpret at ions 

Using reading aptitude and reasoning aptitude as predictors, sig- 
nificant 'interactions were not found. There was some indication that 
time and errors on the introductory program, may be of interest. The 
authors suggest that the treatment time may have been too short and 
that the predictor variables were not sufficiently error-free for the 
regression analysis to be appropriate. 



Cr it ical Comjnentary 



Instructional methodology research study results have been very 
mixed when comparing one strategy directly with another strategy. 



ATI research has been attempting to find which learning strategies and 
which materials m.ight be successful with which learning attribute or 
style. This study was a comm.endable effort to refine the ATI analysis 
in search of possible interactions. The statistical analysis was care- 
fully done, and the further investigation of the effect of the extrem.e 
score indicated a sincere etTort to Icok for the interactions and to re- 
port the results. 



After the first staLi?^ical procedure found no significant differences 
between the treatm.ent effects (Guided Discovery and Meaningful Didactic), 
they quickly moved on to the second analysis. Most of the analysis was 
based on the predictor variable data collected during the treatm.ents. 
This type of post-hoc analysis is an excellent strategy to search for 
hypotheses and could have led to further studies if significant inter- 
actions had been found. 

Although the statistical analysis was extensive and well-handled, 
the problem, analysis of the variables involved in the predictor measures 
(Reading and Reasoning) and treatm.ents (Guided Discovery and Meaningful 
Didactic) might have been clarified further. 

Possibly the words in the title "Designing Instructional Methods" 
might be a bit misleading since the study dealt m.ore with finding ATI 
relationships than with designing the instructional m.ethods. 



The authors suggest that the predictor variables were not sufficiently 
error-free for the statistical applications used. This is quite possible. 
It may also be that learning and instruction is so m.any-f aceted that 
collected data will be too "coarse" for the statistical procedures to be 
meaningful. 



This study has added to the body of information on 
cover ATI effects that many teachers believe to exist, 
refine research further is a continuing problem.. 



at tem.pt s to un~ 
Where and how to 



PRESCHOOL CHILDREN'S KNOl'TLEDGE OF ADDITION AND :5UBTR.\CTI0N . Brush, Lorelei 
R. Journal for Research in Mathematics Education , v9 nl, pp49--3^), 
January 1978. 



Expanded Abstract and Analysis Prepared Especially for I.M.E. by Charles » 
E. Lamb, The University of Texas at Austin. 

1 . Purpose 

The purpose of the study was to investigate young children's inter- 
pretations of physical manipulations of sets of objects. In particular, 
two questions were asked: 

(a) "How do children interpret manipulations that do 
contain changes in quantity?" 

(b) "Ar-L young children able to interpret correctly changes 
in a set through the addition or subtraction of 
elements?" 

2 . Rat ionale 

Much of the research on children's number knowledge has concentrated 
on the young child's interpretation of "sam.eness" in quantity or am.ount 
(Piaget, 1952, and follow-up studies of conservation cf num.ber). Seldom, 
has the focus been on the child's conception of "differences" in quantity. 
In many cases, children who are nonconserver s will misinterpret an equal- 
ity and indicate that it is a difference. This study was designed to 
investigate, in detail, the child's conception of "difference." 

A number of studies have been done which include a sam.ple of sim.ple 
arithmetic tasks for children (e.g., Smedslund, 1966; Wohlwill / 1969) . 
Results indicate that m.ost children are able to, identify the set with 
"more" objects after objects had been joined to the set. This does not 
mean that the child understands addition as comjnonly taught in schools. 
The terms add it ion and subtraction , as used by the author, deal with the 
manipulations of joining or taking away elem.ents of sets. This does 'not 
imply a numerical m.anipulat ion with the sets. 

The present study was designed to look at m.ore complicated m.an;Lpu- 
lations than previously dealt with in research studies. Simple tasks 
were altered by changing one or m.ore features in the task. Alterations 
were assum.ed to increase the level of difficulty for the child. 

Two studies are reported. The first uses !iine basic tasks and the 
second uses additional exercises along with the basic ones to exam.ine 
more carefully children's difficulties with problems of this nature. 



3. Research Design and Procedure 



Study 1: Twenty-six nursery school and kindergarten pupils (mean 
age 61.5 months) were used in the study. They were classified by sex 
and socioeconomic class. "To be included in the study, each child was 
required to pass two criterion tests. One test asked for a judgment of 
a sta^:ic equality, the other an inequality. " 

Each child received nine tasks as part of a larger battery of tests. 
The apparatus for the tasks consisted of two cylinders. They were 
screened from view with the exception of Lhc top 5 cn^. and were plnc'cd 
before the cliLlJ. A one-to-one c:or respondcnce of 1 /f ninrbles was form.ed by 
dropping them one by one into the cylinders. The child sensed that there 
was equality in am.ounts of num.bers. The nine tasks are indicated below: 

(1) Simple Addition one marble adde.d to cylinder on 
exper im.enter ' s lef t . 

(2) Addition Inverse - immediately following Task 1, one 
marble rem.oved from the left cylinder. 

(3) Simple Subtraction - following equivalence of 14 m.arbles, 
one taken from the left cylinder. 

(4) Subtraction Inverse - following Task 3, one m.arble added 
to the left cylinder. 

(5) Complex Addition - two marbles added to the right, one 
to the left. 

(6) Complex Subtraction - two m.arbles taken from, the right, 
one from, the left. 

(7) Com.bined Addition and Subtraction - one m.arble added to 
the right, one taken from, the left. 

(8) Addition and Inequality - a one-to-one correspondence of 
12 marbles was established; then four added to the right, 
one added to the left. 

(9) Subtraction and Inequality - sim.ilar to //8 , then one m.arble 
taken from, the right. 

In each task, the child was asked, "Do both jars have the same 
num.ber of marbles, or does one jar have m.ore m.arbles?'"' The order of 
presentation was randomized (except that #2 always followed //I and //4 
always followed //3). 

Study II: Sixty children were used in the study (20 from 4 to 5 
years of age, 20 from 5 to 6 years of age, and 20 from 6 to 7 ye'^rs of 
^ge). Again they were classified according to sex and socioeconomic class. 



Again criterion tasks were used. 



Each child received 12 items which are described below: 

(1) Simple Addition: (left-hand set L) 11 flowers + 1 
flower , (right-hand set R) 11 flowers . 

(2) Addition Inverse: (L) 11+1-1, (R) 11. 

(3) Simple Subtraction: (L) 11- 1, (R) 11. 

(4) Subtract on Inverse: (L) (11 - 1) + 1, (R) 11- 

(5) Complex Addition: (L) 11 marbles + 1 marble, 
(R) 11+3 marbles. 

(5) Complex Subtract ion : (L) 11 soldiers - 1 soldier, 
(R) 11-3 soldiers. 

(7) Omitted 

(8) Addition and Inequality: (L) 9+1 candles, (R) 
13 candles. 

(9) Subtraction and Inequality: (L) 9 buttons, (R) 
13-1 buttons. 

(10) Small Number Addition and Inequality: An inequality 
of two and six marbles was established. One m.arble 
was added to the two-marble cylinder (L) . 

(11) Small Num.ber Subtraction and Inequality: Siniilar to 
Task //lO, then one marble was rem.oved from. (R) . 

(12) Addition and Vast Inequality: An inequality of two and 
15 paper clips was established. One paper clip was 
added to the smaller group (L) . 

(13) Subtraction and Vast Inequality: Sim.ilar to //12, then 
one paper clip was removed from (R) . ^ 

Children were asked three short questions for all situations: 

(1) "Do we have the same num.ber of ?" 



(2) "'Does one of us have m.ore and 



(3) "Why is t^at?" 
Tasks were presented randomly with ceiftain restraints: 

(1) //2 followed //I; (2) //4 followed //3; (3) //10-13 were given 



to children who missed either //8 or ^^9; and (4) ^^10-13 were 
given in that order at the end of other tasks . 

Findings 
Study I: 

(1) No sex or socioeconomic differences were observed in 
per f orm.ances on tasks. A m.ajority of the children 
got all tasks correct. 

(2) Tasks 1, 3, 5, and 7 seem.ed very easy (over 90% of 
subjects correctly responded). 

(3) Com.plex Subtraction was m.ore difficult than Sim.ple 
Subtrac t ion or Complex Add it ion . 

(4) Inequality tasks were m.ore difficult than tasks 
involving only the simple operations. 

(5) Inverse tasks were somewhat more difficult than 
tasks using simple operations. 

Study II: 

(1) Again, there was no difference with regard to 
sex or socioeconomic class. 

(2) Tasks 1 and 3 were very easy. 

(3) Inverse and Inequality tasks were m.ore difficult 
than tasks involving sim.ple operations-. ' 

(4) Subtraction seem.ed *m.ore difficult than addition. 

(5) Performance increased with age. 

(6) Children seem, to be able to cope with initial 
inequalities when the initial sets are small or 
if differences are quite large. 

Interpre tat ions 
Study I: 

(1) Children do surprisingly well on tasks involving simple 
ar ithme t ic opera t ions . 

(2) The author feels that most children will make the transition 
from tasks of this sort to num.erical problems with little 
difficulty. 



(3) Most children recognize the inverse relationship between 
the adding and subtracting operations. 

(A) The author suggests use of these tasks by teachers as 
a diagnostic tool, with particular attention paid to 
the child's use or nonuse of numerical ideas. 

(5) Because of children's sophistication with these ideas, 
the author suggests that teachers use the child's 
basic knowledge to build mathematical knowledge. 



Critical-, Commentary 

(1) The research j s in an important area. It examines in a 
detailed way some very basic arithmetic ideas which 
children have. The use of one-on-one interviews is 
commendable. 

\^ 

(2) The sLatem.ent that "m.ost recogni'ze the inverse relationship 
of these two operations" appears to contradict other* results 
in the related literature. Mere discussion of this point 
would have been helpful. Perhaps the differences are due 
to. task differences. 

(3) "For example, if a child spontaneously uses numbers in 
answering these tasks, the teacher is assured that the 
child has a solid basis of knowledge on which more formal 
arithmetic structures can be built" is a statement that 
seems a bit strong. The spontaneous use of numerical 
information may indicate confusion rather than understanding. 

(4) Studies of this sorb are very important to the further 
understanding of the young child's number . conceptions . 
Unfortunately, the results are often generalized too 
far. These results, when analyzed and synthesized with 
others, should prove very fruitful. 
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LOCATION OF A POINT IN EUCLIDIAxN SPACE BY CHILDREN IN GRADES ONE THROUGH 
SIX. Carlson, G. R. J ournal of Research in Science Teaching , vl3 n4 , 
pp331-336, 1976. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by Carol A. 
Thornton, Illinois State University. 

1. Purpose 

(a) To use Piagetian-type tasks to examine the ability of a sample 
of children to locate quantitatively a point in one, two, and 
three dimensions. 

(b) To exam.ine a sm.all part of Piaget's theory of cognitive develop- 
ment . 



2. Rationale 

Piaget's theory of cognitive developm.ent form.s a theoretical base 
that may be related to science and science instruction. This study was 
designed and is interpreted according to the framework Piagef provides 
in exploring children's understanding of Euclidean space. The specific 
abilities exam.ined in the study are needed in such' scientifically and mathe- 
matically related activities as quantitative linear m.ea'surem.ent (as op- 
posed to qualitative com.parisons of lengths) a;nd precise two-d im.ensional 
graph construction and interpretation. 

3 . Research Design and Procedure 

The population for the study was drawn from, grades one through six 
of a midwest elem.entary ^ichopl. In order to aid in identifying any age 
level (s) at which subjects would show a m.arked change in ability with 
respect to any of -the tasks, the population at each grade level was 
stratified and separated from, adjacent grade levels by an age span ijf 
six months. For the sam.ple, eight males and eight fem.ales were random.ly 
selected from each grade level — a total of 96 subjects. 

The three tasks posed to subjects of the study were basically the 
standard Piagetian tasks for exact location of a point in one, in two, 
'and in whree dim.ensions (tasks 1, 2, and 3 respectively). Task 1 involved 
two strings stretched tightly between two nails. The two strings were 
th^sam.e length and parallel, but were staggered so that endpoints were 
not aligned. As a subject watched, the experim.enter m.oved the bead on 
the left end of one string 14 centim.eters to the right. The subject was 
then asked to m.ove the bead on the. left end of the other string just as 
far, using any^,of the unm.arked measuring instrum.ents which were available. 
(One was the sam.e length, one longer, and one shorter than 14 cm..) 



In Task 2 the subject was shown two cards placed in opposite corners 
of a ].arger card. The experimenter placed a dot on one of the cards and 
then asked the subject to mark his or her card in the same place. Poten- 
tial measuring instruments cx..d a pencil were provided. 

In Task 3 the subject was shown two skeletal cubes depicting identi- 
cal small rooms. The experimenter pointed out a small clay ball (simulating 
a light bulb) which hung from the ceiling of one of the "rooms". The sub- 
ject was asked to put an identical clay bal] in the same place in his or 
her room. Several rods of different lengths and other potential measuring 
instruments were provided. 

Subjects were presented all three tasks on a one-to-one basis over 
a six-week period,. Each interview required 20-25 minutes. Piaget's three 
scoring stages were used to evaluate children's responses: Stage I — child 
did not understand the task or the principles involved; Stage II — child is 
in a transitional stage, groping for the right solution; Stage III — child, 
by correct response and explanation, demonstrated a steady understanding 
of concepts involved. In addition, substages were used to assess more 
accurately children's thinking. Subjects were also scored on a pass (reacl^d 

Stage III) or fail basis. 
« 

A three-factor ANOVA was used to analyze overall task performance for 
the six grade levels. A comparison was also made, within subjects, of per- 
formances on the one-, two- and three-dimensional tasks. A chi-square 
analysis was performed to assess the relationship between sex and task 
performance. 

4 . Findings 

The analysis of variance revealed a significant difference in the 
overall task performance (one Tasks 1, 2, and 3 collectively) between 
the six grade levels. Subjects did progressively and continuously ' bet ter 
in their overall performance from grades one through six. 

On the specific task analysis, the performance of subjects was found 
to differ significantly (p < .05). One-dimensional tasks were easier than 
the two-dimensional tasks, and these in turn were easier than the three- 
dimensional tasks. No sex-linked differences were found in the chi-square 
analysis of task performance by females and by males. 

5. Interpret at ions 

(a) The results of the study indicate a difference between a 
subject's ability to locate a point in two and in three 
dimensions. This conclusion disagrees with Piaget*s data 
and theory, in which no time-lag whatever Existed in the 
achievement of successive levels of two- and three-dimen- 
sional measurement. 



(b) Contrary to Piaget's findings, in which a decisive turning p oint 
in subj ects ' development of spat ial concepts related to these 
tasks occurred (at about age nine), the data of the present 
study relative to subjects' performances on twj- and three- di- 
mensional tasks indicate a smooth and continuous increase in 
achievement . 

(c) The following remarks are based on analysis in which 75 per- 
cent or more subjects passed the task. This 75 percent cri- 
terion has previously been used by Piaget to analyze group 
performance . 

1) Activities requiring only qualitative measurement 
(defined by Piaget as the use of a measuring instru- 
ment which is longer or the same length as the dis- 
tance to be measured) would probably be meaningful 
to these subjects by about the second-grade level. 
Prior to this level these subjects have only a know- 
ledge of which of two objects is the longer and no 
measurement can be made. Activities such as strip- 
chart graphing, however, showing the relative or 
comparative growth of two plants, would be meaningful 
-to most children by first grade. (Based on analyses 
of Task 1) 

2) Science-related activities requiring students to 
make a line graph, to locate points on a map, or 
to make precise geometrical drawings may not be 
meaningful to these subjects until the fourth grade. 
(Based on Task 2 analyses) 

3) Activities requiring the subjects to visualize precise 
quantitative three-dimensional shapes or to make, for 
example, a quantitative comparison of the location of 
an airplane in flight with respect to its point of 
destination may not be meaningful until approximately 
the sixth-grade level. (Based on analyses of Task 3) 

Critical Commentary 

Replication studies, such as this one, are important in their own 
right. Both from the perspective that much of Piaget 's research has 
direct implication for teaching that involves mathematics or mathe- 
matically-based science tasks, and that Piaget worked with children 
culturally different from our own, it is appropriate to duplicate his 
studies . 

That the present study was carefully designed and executed is 
evident from several standpoints; to be noted in particular is the 
development of stratification procedures prior to sampling and the 



adherence to protocol during the interviews. The cautious, yet insight- 
ful, intcrpretat ion of the data is both educationally interesting and 
informative. 

The biggest problem in reviewing the research lay in the logic, 
clarity, and completeness of the research report. In describing the 
methods and procedures, tor example, it would appear that . the general 
design of the three tasks used in the study should appear before the 
layout of administration and scoring procedures. At tim.es the writing 
style is awkward or unclear. Several im.portant sources or item.s of in- 
f orm.at ?.cr., if included, would have strengthened the overall clarity of 
the data analysis: the ANOVA table, commentaries on tabled information 
which did appear, and a report of m.eans and standard deviations. Post 
hoc analyses in case of an overall significant F ratio might also have 
been used and reported to support differences in specific com.parisons. 
Apart from these considerations the research it<?elf and the implications 
drawn are both important and valuable. 



IDENTIFYING FRRORS IN SOLVING CERTAIN LINEAR EQUATIONS. Davis, E. J.; 
Cooney, T. J. Mathematics Association of Two-Year Colleges Journal , 
v2 n3, ppl70-178, Fall 1977. 



Expanded Abstract and Analysis Prepared Especially for I.M.E. by 
Bert Waits, The Ohio State University. 

1. Purpose 

To investigate the type and frequency of errors commonly made by 
algebra students in solving linear equations. 

2. Rationale 

Teachers need to be knowledgeable about the types of difficulties 
students encounter when solving linear equations (1) in order to teach 
the topic effectively and (2) to help the individual student overcom.e 
a particular difficulty. 

V 

W 

3. Resear c sign and Procedure 

Approximately 110 "second-year basic algebra'* and "regular algebra" 
students were ^iven a 12-question exam.ination designed by the authors to 
test a wide range of m.anipula tions involved in the solution of linear 
equations. The students used in the investigation were thought to be no 
different from, other algebra students from, the sam.e high school. The 
students' teachers stated that they were prepared for the examination. 
The students were told their "performance would be record^^d." 

The authors identified 11 categories of errors: 



lA. Misuses of the rules for adding or 
subtracting positive and negative 
numbers 

2A Misused rules for m.ultiplying 
or dividing positive and 
negative numbers 

3A* .Misuse of the additive 

property of equality — this 
category includes "trans- 
posing" errors 

AA Coefficient error — uses 

additive rather than multi- 
plicative inverse 



IB Arithm.etic errors in 

addition or subtraction 



2B Arithmetic errors in 

multiplying or dividing 

3B Misuses of the multi- 
plicative property of 
equality 

4B Coefficient error — m.is- 

uses m.ultiplicative inverse 



5A 



Does not finish — 
original equation 
contains fr act ions 



5B Does not f inish-'-or i^ii. al 
equat ion does not coiita in 
, fractions 



6 Others (the majority 

of these were miscopying 
errors and undecipherable 
solutions) . 

The authors graded and analyzed each test and recorded the typus of errors 
made. 



4. bindings 

The authors' findings, consisting 
by category, were presented in several 
the frequency of error by category for 
students. 



of a frequency analysis of errors 
ways. The last two tables indicate 
"good" students and for "poor" 



Table 1 

Number and Relative Percent of Errors in Each Category 
for' Regular and Basic Algebra Student s 
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IB 


2A 
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3A 


3B 


4A 


4B 


5 A 


5B 
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Total 
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27 
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5 
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Algebra 
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3% 


9% 
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38 Basic 


60 


10 


13 


21 


3 


18 


9 


10 


26 


1 


17 


188 


Algebra 


32% 


5% 


7% 


11% 


2% 


10% 


5% 


5% 


14% 


1% 


9% 




Students 


























Total 


i28 


18 


34 


62 


14 


45 


21 


18 


51 


6 


51 


488 


for 110 


29% 


4% 


3% 


14% 


3% 


10% 


5% 


4% 


11% 


1% 


11% 





Students 



Table 2 



lA 

Lar Alge- 
bra SL'udcnLs 

9 Basic 10 

Algebra 

S L lid fill i-; 



IB 

0 



Errors by Category for Students 
Getting 10 or 11 Equanlons Correct 



2B 
14 



3A 



3B 
2 



4A 
1 



4B 
1 



5A 
4 



5B^ 
0 



6 Totals 
6 49 

0 20 



Toe a 1 

for l'") 
Scudc'n t : 



19 



69 



21 Regu- 
lar Aliv;-- 
bra Stiidont 

18 

Algehr.i 
S t ud',*n L s 

Tot a I 
for 39 
St-jden t s 
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Table 3 

Errors by Category for Students Getting 
only 2, 3, 4, 5, 6, or 7 Equations Correct 



lA IB 



15 



I 0 



2B 
24 

14 



3A 
18 

16 



3B 4A 
8 10 



4B 
6 



10 



17 38 34 10 15 16 



5A 
17 

25 
42 



5B 
6 



6 Totals 



117 



12 



38 



149 



326 



^ ^L?£P. ^ ^- ^ t io n s 

The authors discussed the following question: 

(a) How well did the students do on the inventory? 

Very Well: Mean 8.69, Median 10, and Range 
2 to ] 



(b) To what extent v;ere errors corr^pu ta t ional in nature? 
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Considerable: At least 54% of the errors were 
category 1 or 2. 

(c) To what extent were errors attributable to an in- 
ability to apply principles for solving equations 
correctly? 

\ 

22% of the errors were in categories 3 and 4, 

(d) To what extent were students unable to complete 
the process of solving an equation? 

There were 57 cases (the percent was not stated) 
of category 5 errors; 51 of them involved equations 
with fractions (e.g., 3_ _ ^ )» while only 5 (of 

2 " 7 

12) problem.s involved fractions. 

(e) Did students who scored high m.ake different types 
of errors from students who scored low? 

Most errors (75%) of the "good" students (those with 10 
or 11 correct solutions) were computational, while 

50% of the errors of the "poor" students (those with 2 to 

7 correct solutions) were ■'^omputational. 

In conclusion, the .luthors point out that their analysis could have 
other positive benefits in addition to the diagnostic assistance given 
to the individual student. It can be used with the entire class to 
discuss the types of errors made (and ways to correct them). They note 
that perhaps such a class discussion can help the individual student avoid 
similar pitfalls. 

Critical Commentar y 

This investigation is interesting but limited in scope and subject 
to subjective interpretation. The authors adm.it that their error cate- 
gorization system, is "but one of many systems that could be developed and 
used." 

Are there better error analysis systems? 
Is their system valid? 

What constitutes a valid error analysis system.? 

Are the authors* judgments about types of errors 
appropriate? 

These are difficult questions. Indeed, there are no "correct" answers. 
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No previous research was cited. Has there been similar research con- 
ducted on the same or related topics? 

It was stated that the students were from a Georgia high school. 
What was the nature of "second-year basic algebra'^? Was it remedial? 
No effort was made to analyze the difference, if any, between the two 
groups of algebra students. Tables 2 and 3 would be enhanced considerably 
if percentages had been given (as in the first table) . 

The authors have objectively quantified student errors for one type 
of elem.entary algebra task. However, is their system practical for the 
typical classroom teacher with 120-150 students and many algebra tasks 
to teach? 

It is clearly aesirable that teachers know what kind of errors their 
students m.ake so that steps can be taken to correct the errors (both in 
teaching strategies and in directing relearning by the individual student). 
Perhaps the authors' ideas can help the classroom teacher com.e up with a 
more efficient system, (albeit more subjective) to analyze and resolve 
students' errors. 
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PRETRfUNING CHICAMO STL'DRNTS BI-:rOPJ-: ADMTNISTR.\T [ON OF A MATHEMATICS 
PREDICTOR TKST. (iinther, Jonn R. Journal for Research in Ma th efiin t ic s 
Education, v9 n2, ppll8-125, Marcli 1978! 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by C. 
James Lovett, Brooklyn College of the City University of New York. 

1 . Purpose 

The purpose of this study was to determine if the reliability and 
predictive power of certain m.athem.at ic s predictor tests given to Chicano 
students could be increased by providing pretraining in test-m.arking 
strategies before administration of the tests. 

2 . Rat ionale 

Earlier studies have found that ability tests that are good pre- 
dictors of m.athematics achievem.ent for Anglo students are not generally 
good predictors for Chicano students. Bernal (1971) found that teaching 
Chicano students test-m.arking strategies im.proved their results on 
mental ability tests, but the report did not include the effects, if 
any, on the reliability or predictive power of the tests. The present 
study was designed to investigate the possibility of such effects. 

3 . Research Design and Procedure 

The subjects were 136 students from the seventh-grade population 
of a junior high school selected because it had a large Chicano pop- 
ulation. Students for whom. English was a second language were ex- 
cluded from, the study. Students were classified as Chicano if they 
had a Spanish surname. On this basis, 44 percent of the sam.ple was 
Chicano. 

On a classroom basis, the sample was random.ly divided into two 
groups. One group received pretraining in a test-m.arking strategy 
for an Ar ithm.e t ic Reason ing Test . The other group received s im.ilar 
pretraining for a Missing Words Test. In both cases, the 30-m.inute 
training session consisted of an explanation/discussion of the strategy 
and practice on sam.ple item.s. Both tests were adm.in is tered to the 
entire sam.ple. Over a five-day period of time, the students then 
studied a set of program.m.ed instruct ioiial m.aterials and were adm.in is tered 
a posttest related to the content of the m.aterials. Descriptions of 
the test-m.arking strategies, sources for all tests and m.aterials, and 
a detailed schedule for each phase of the study are included in the 
original report. 

A retention study is also reported- The above experim.ent was 
conducted in June 1975; in Septem.ber both predictor tests were adm. in is tered 
again . 
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4. Findini^s 



The sample was divided into four groups according to whether students 
were Chicano (C) or non-Chicano (NC) and whether they were pretrained for 
the Arithmetic Reasoning Test (AR) or the Missing Words Test (M^^/) . For 
each group a reliability coefficient (Cronbach's Alpha) was computed for 
each predictor test. On the basis of tabular data, it was noted that 
pretraining improved the reliability of the Arithmetic Reasoning Test for 
Chicano students, and that this effect was maintained for the second 
testing (retention study). Similar effects were not obtained for the 
nou-Chicano group on the Arithmetic Reasoning Test or for either group 
on the ^.issing Words Test. 

The predictive power of a test was defined as its correlation 
with the posttest. The results are summarized below. 



Group Ar i throe t ic Reasonin g Missing Words 

C-AR . -67 .55 

C-MW .40 .72 

NC-AR .53 .31 

NC-MW .49 ^52 



The difference between the correlation coefficients for each pre- 
training/no pretraining condition was tested for significance for 
Chicano and nor-Chicano groups- For the Arithmetic Reasoning Test the 
dAffercncG between the pretrained (C-AR) ana the not pretrained (C->fW) 
Chicano groups was significant at the .08 level. For the Missing Words 
T(;st, the difference between the prei:rained (-^^W) a.-^id not pretrained 
(-AR) groups was significant at the .15 level for the Chicano students 
and at the .14 level for the non-Chicaao students. For che Chicano 
groups, the probability that both increases in ..-predictive power took 
place simultaneously by chance is less than .012 (.03 x .15). 



5. I nterpretat ions 

The pretraining improved the reliability and predictive power 
of the Arithmetic Reasoning Test for the Chicano students, but it had 
no effect for the non-Chicano students. For the Missing Words Test, 
pretraining had no effect on reliability for either group; it had a 
positive effect, however, on the test's predictive power for both 
Chicano and non-Chicano students. The results tend to confirm those 
of Bernal (1971). 

The author calls for replications of her study using a variety 
of other mathematics predictor tests. If the replications obtain 
similar results, then it is recommended that testing of Chicano 
students be preceded by pretraining. 



Critical Commentary 



This type of research Is of Interest to m.any people In the area 
of bilingual education. This Includes school officials responsible 
for the placement of children as well as researchers. The author 
appears to have conducted a very careful study, especially In terms 
of the procedures employed. There are several points, however, which 
I feel m.ust be raised. 

The use of Spanish surnam.e as the defining criterion for *'Chlcano'* 
Implicitly assumes that this classification yields a relatively homo- 
geneous group with respect to variables relevant to school learning. 
Such an assum^ptlon Is open to challenge even for a sample drawn from 
a single school (see Ramirez and Castaneda, 1974). Furthermore, as 
written, the report suggests that Chlcano students with English as a 
second language were excluded from the study. If this was Indeed the 
case, I begin to wonder about the theoretical basis of the study. How 
were Chlcano and non-Chicano groups different other than by surname? 

There are several points where I must dlsag ree with the author s 
treatment and interpretation of the data. First, the author claim.s that 
the retention study of test reliability supports the initial conclusion 
of treatment effect for the Chlcano group. The data presented, however, 
show that over the summer m.onths, with no further treatm.ents, the re- 
liabilities changed by an amount larger than the original difference 
between the two groups (from .57 to .72 for one group and from .26 to 
.08 for the other). Such changes suggest the possibility of factors 
other than the maintenance of a treatment effect. 

In another m.atter, I may be old-fashioned, but I cannot agree 
with the author's style of reporting significant differences. If 
authors choose not to use the traditional .05 level for decisions re- 
garding significance, then I think they should tell us at just what 
point they no longer consider a difference to be significant. I m.ust 
also take issue with claims based on m.ultiplying the probabilities 
found for various differences. This approach yields differences 
tested, and I do not see that any useful inform.ation is gained. 

Finally, there is an aspect of the data not m.entioned by the 
author, but which I think should be noted. The study is based on 
the supposition that tests that are good for Anglo students are not so 
for Chlcano students. Yet, in the table provided we see that the 
predictive power of the Missing Words Test is greater for the not 
pretralned Chlcano group (.55) than for the not pretrained non-Chicano 
group (.31). Even for the Arithmetic Reasoning Test the difference 
between the not pretralned Chlcano (.40) and non-Chicano (.49) groups 
Is not large. These results, which contradict the earlier studies 
cited by the author, are not discussed or even noted. 

In conclusion, I agree that replications are needed, but I would 
urge researchers to give serious consideration to the points m.entioned 
above before initiating such studies. 
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RELATION'SHTPS BFTV.TKN CLASSROOM BEHAVIOR AND COGNITIVE DEVELOPMENT 
CHARACTERISTICS VA ELEMENTARY SCHOOL CHILDREN. Good, Ron; Matthews, 
Charles; Shymansky, James; Penick, John. Journal o f Research in 
Science Teaching , vl3 n6, pp533-538, November 1976. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by 
Karen Fuson, Northwestern University. 

1 . Purpose 

This study examined the relationship between the perform.ance of 
students in grades 1 through 5 on seven conservation tasks in nine 
categories of science classroom, behavior. Two different types of 
science classroom.s were examined — student-structured and teacher- 
structured . 



2 . Rat ionale 

This study was an attem.pt to relate som.e student cognitive character- 
istics as measured by Piagetian conservation tasks to some classroom, 
learning behaviors. The basic questions of the study were: "Do conservers 
behave differently from, non-conservers in science classrooms?" and "Are 
differences constant across different classroom '=*xivironm.ents?" 

3. Research Design and Pro cedure 

All of the students in the first five grades at Florida State Univer- 
sity's Developm.ental Research School comprised the sample. Students r re 
selected for the school to represent the population at large (the nature 
of the population at large was not specified). Students were randomly 
assigned to the TSLS (Teacher Structured Learning in Science) or the SSLS 
(Student Structured Learning in Science) treatm.ents. Piagetian conser- 
vation tasks of the following types were given to each student at the 
beginning of the year and at the end of 30 weeks: num.ber, area, weight, 
displacem.ent , perim.eter, perimeter-area, and internal volum.e. Student 
behaviors on a set of nine categories were coded throughout the year 
(number of tim.es unspecified.) However, only the post-test scores and 
a coding done near the end of the 30 weeks v;ere used for this study. 
One-way analysis of variance was used to determ.ine, within each type 
of classroom, whether conservers and non-conservers differed on any of 
the ninls behavioral categories. 

A. Findings 

1. In a teacher-structured but not in a student-structured 

setting, non-conservers observed the teacher significantly 
more often than did conservers. 
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2. In a teacher-structured but not in a student-structured 
setting, conservers followed teacher's directions signif- 
icantly more often than did non-conservers . 

3. In a student-structured but not in a teacher-structured 
setting, conservers tended to make up their own activities 
with sets of manipulative materials significantly more 
often then did non-conserver s . 

4. No significant differences in either type of setting 
existed for the following categories: responds to 
teacher question, initiates interaction with teacher, 
initiates interaction with another student, receives 
ideas from another student, copies another student, 
gives ideas to another student. 

5. Ill Lt!rpre tat ions 

The authors report the following interpretations of their findings 

If a science curriculum provides each child 
with an opportunity to work with manipulative 
materials which offer a wide range of possible 
activities, then the "less cognitively advanced" 
children in a classroom are more susceptible 
to the influence of certain teacher behaviors 
than are "more cognitively advanced" children. 
Teacher's directions, evaluations, etc., seem 
to divert the non-conservers ' a ttent ion away 
from more productive activities with science 
materials to a greater degree than for conservers. 
Because of this, a TSLS strategy could work to 
the disadvantage of the "slower" students in a 
c lass room even though they may have the most to 
gain from a science class rich with manipulative 
materials. 



Critical Com>m>entary 

The objective of this study was a noteworthy one. Relatively few 
studies exist which relate student performance on Piagetian tasks to 
student school outcome measures and even fewer which relate such per- 
formance to student classroom behavior. The atta-jk on the basic pro- 
blem seems to have been a fairly comprehensive one. However, because 
classroom behavior can be considered as a variable intervening between 
cognitive abilities and achievem.ent , the design would have been strong( 
if it had also included measures of student learning in the sc ience 
class. Som.e examination of differences across grade level might also 
have been done. 
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Unfortunately, the study was not reported well. No mean scores 
were given in the entire paper, so one has no way to judge what the order 
of magnitude of the difference in behavior was. The text contains some 
irritating errors: one of the categories is omitted from Table 1 and 
*'non-conservers'* is used in the place of "conservers" in reporting one 
of the m.ajor results. Results are reported and discussed much less 
specifically than one should expect, and some important findings (such 
as differences between the cognitive tasks) are not discussed at all. 
Giving the num.bers of conservers and aon-conservers at each grade level 
would have been in f orm.at ive , A two-way analysis of variance using 
conserving ability and type of classroom, would have been a m.ore appro- 
priate analysis. This study was basically well-designed, and the 
coding of student classroom, behavior and the Piagetian interviews of 
250 children represent m.ajor investm.ents of time and energy. However, 
little can be m.ade of the results because of inadequate analysis and 
reporting. 

The im.plications the authors draw from their results deserve 
special mention (see the Interpretations section above). The results 
as reported m.erely say that in teacher-structured classroom.s, non-con- 
servers observe the teacher m.ore and conservers follow the teacher 
directions m.ore. It is not cljar whether this difference arises be- 
cause the non-conservers cannot follow the teacher ^s directions and 
must be shown the activities, whether the teacher is dem.onstrat ing for 
everyone and the conservers sim.ply do not pay attent ion/','---43r whether the 
teachers give in to the tem.ptation to do an activity for the non-con- 
servers because it is easier than helping them, do it themselves. It 
may be that the less advanced students can learn m.ore from, watching 
a well-organized and clearly explained experiment than they can by m.ud- 
dling through a 1 it t le- understood procedure them.selves. It is unclear 
whether the authors' inference that the teacher diverts non-conservers' 
attention "away from, m.ore productive activities with science m.aterials" 
stems from a value that activity in itself is productive and good or 
from some unreported data (or even intuitions from, observations) about 
the non-learning nature of the non-concerver interactions with the teach- 
er. If it is the latter, then the authors should have reported that 
additional data to substantiate their claim.. That a teacher-structured 
classroom can work to the detriment of slower students would be an 
important result (though achievem.ent data would be useful in answering 
this question). However, the data as reported do not perm.it this in- 
ference to be drawn. 

Further, it is not clear tnat conservation ability is causally 
related to the dependent m.easures at all. The cognitive characteristics 
as measured by the conservation interviews m.ay be "influential in de- 
termining" interactions, or som.e other student character ist ic (s) cor- 
related with those cognitive characteristics m.ay be the actual "cause". 
And, in fact, the pre-test conservation m.easures would logically seem, 
to be more likely causes of student behavior which "stabilized early in 
the year" than would the post-test measures. 
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One note of interest in this study not mentioned by the authors is 
that .the significant differences in classroom behaviors ot the conservers 
and non-conserver s were reflected more on the area, weight, perimeter, and 
internal volum.e tasks than on the number, displacem.ent , and perim.eter- 
area tasks. Thus, in future research, one m.ight wish to focus only on 
these tasks. These task differences m.ay, however, partially result from 
the age range studied (e.g., number conservation may have reached a 
ceiling). Again, som.e reporting of possible effects of grade level would 
have been inf orm.at ive . 

In sumjnary, one can conclude very little from this article. If the 
article had been m.ore carefully reviewed before publication, som.e of the 
major shor tcom.ings in the reporting of thi^ results might have been cor- 
rected, and the study might have contributed to o jr knowledge in this 
area . 
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THE EFFECTS OF GUIDED DISCOVERY AND INDIVIDUALIZED INSTRUCTIONAL PACKAGES 
ON INITIAL LEAR:;I:;G, TRANSFER, AND RETENTION IN SECOND-YEAR ALGEBRA. 
Hirsch, Christian R. Journal for Research in Mathematics Education , v8 
n5, PP359-368, November 1977. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by Aaron 
D. Buchanan, SWRL Educational Research and Developm.ent , Los Alam.itos, 
California . 



1 . Purpose 

To see which of three m.odes of instruction would be nost beneficial 
in promoting: (1) initial learning of elem.ents of structure in the 
system, of com.plex num.bers, (2) vertical and lateral transfer m.easured 
at the same time as initial learning, and (3) retention of initial learn- 
ing measured six months later. The three modes of instruction were; 

(a) guided discovery , where learning was guided by a teacher 
using special techniques to prom.ote discovery 

(b) expository ind ividualized ins true t ion , where students went 
through a special learning package written in linear form.at 

(c) programmed individualized instruction, where students went 
through a special learning package written in branching 
format . 



2 . Rat ionale 

Experts in mathem.atics education have advocated that instruction be 
presented in such a way that pupils can discover elem.ents of m.athem.at ical 
structure for them.selves. However, techniques for prom.oting discovery 
learning have not been used very extensively in traditional teacher- 
centered instruction. For this reason, ind rvidu^lized instructional 
packages have often been used as one way to counteract the persistent 
use of traditional teaching methods which ostensibly inhibit discovery 
learning. 

3. Research Design and Procedures 

The basic design of the study involved three treatm.ents followed by 
a series of tests (posttest, lateral transfer, vertical transfer, and 
long-term retention) to measure differences in treatm.ent effects. 
Eleven algebra II classes were assigned as entire classes to the three 
treatments. Four classes were given the guided discovery treatm.ent, 
four classes were given an expository individualized instruction package, 
and three classes were given a program.m.ed individualized instructional 
package. There is some indication that groups were assigned to treat- 
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ments based partly on whether or not they had previously been involved 
with individualized instruction packages. The investigator was concerned 
that S^s would have difficulty in adjusting to this type of instruction 
within a short time frame, and that it m.ight result in dim.inished treat- 
ment effects on student performance. 

Two standardized algebra tests were given at the beginning of the 
study so that measures of treatment effects could be adjusted for what 
turned out to be sm.all pre-treatm.ent differences between experimental 
groups. In addition, all classes had completed the same amount of their 
alge'. ra II course work prior to the study. 

The investigator and two regular classroom teachers were responsible 
for the guided . discovery treatment. The investigator would teach a 
dem.onstracion lesson to one guided discovery class each day which the two 
regular classroom teachers would observe and then repeat with their own 
guided discovery classes. Content for the guided discovery sessions was 
taken directly from, the expository individualized instruction package. 
Classes receiving one of the two different kinds of individualized in- 
struction worked through learning packages on their own. 

Each treatment dealt with a common se t of mathem.at ical concept s 
drawn from the structure of com.plex num.bers. Special care was taken 
to insure that guided discovery classes did not go beyond the scope of 
content in the individualized instruction packages. Each treatment lasted 
six days. Guided discovery classes spent six regular class periods in 
instruction; S^s receiving individualized instruction completed six one- 
day learning packages that were either expository or programmed. 

A . Findings 

Two-hundred thirteen S_s were treated as though they had been assigned 
separately to one of the three treatments. The analysis involved 61 Ss 
in guided discovery, 84 Ss who were given the expository individual ize^d 
instruction package, and 68 S_s who were given the programmed individualized 
instruction package. 

A test of initial learning (25 item.s) was constructed by the inves- 
tigator to reflect content in the three truatments. On this test, S_s in 
guided discovery scored significantly higher (p<.01) than S^s completing 
either of the individualized instruction packages. 

The vertical transfer test (11 problems worth 5 points each) included 
topics that were more advanced (e.g., functions of complex variables) than 
students' initial learning, but ostensibly required no more specific know- 
ledge than what pupils had already developed. The only significant 
differences in performance were associated with the guided discovery 
treatm.ent . 

The lateral transfer test (7 problems worth 5 points each) required 
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pupils to generalize elements of structure they had learned about con::)lex 
numbers to other mathematical systems with similar structural character- 
istics. Again Ss in guided discovery scored s i.en i f ican t ly higher th:in 
Ss completing elcher of the individualized instruction packr-es. 

On all threa tests given imjnediately after the treatments, there 
were no significant differences in performances attributable to the 
two different types of individualized instruction packages. On the 
retention test given six weeks later, there were no signiiicant dif- 
ferences attributable to any treatment. Ss in guided discovery did 
score higher than S_s who had received one of the individualized instruc- 
tion packages, but mean scores on the retention test were all very low. 

5 . Inter pre tat ions 

Students guided by a teacher to discover elements of structure in 
the system of complex numbers performed significantly better than stu- 
dents who covered the same content on their own in individualized in- 
struction packages. Not only did guided discovery result in a hieher 
level of initial learning, but this learning transferred better to other 
kinds of related learning tasks. 



Critical Comjnentary 

The results of this study have most significance in showing bettc-r 
effects from teacher-guided instruction than from self-guided packa?,es 
in the learning of new content. Il is not possible to say with much 
precision why teacher-guided instruction wai better. The' study was not 
designed to show the effects of "d iscovery" t eaching as distinct from 
teacher-guided versus self-guided instruction. The rationale gives a 
limited amount of support for the actual experiment as it was conjucLt_'d. 
The investigator tried to establish a relationship between individualized 
learning packages, discovery learning, ,:nd the concept of "active" 
learning, but this rfilat ionship was not m.ade very clear. In general, the 
rationale was tied in only the broadest way to other research^on mathe- 
matics instruction. No specific research studies were cited. 

The investigator did appear to maintain very close control over 
content presented in each of the three treatments, which is always 
difficult in this kind of c lassroom.-based study. Therefore, it is 
possible to infer som.e advantage for teacher-guided instruction. It is 
reinforcing to see results that show significant differences for difreri.'nt 
kinds of instruction, especially when so many other studies show no 
significant differences at all. Now, if we could only make better infer- 
ences about why such differences occur. . 



\ 



36 



STRIJCTURA.. i/lN'ClJlSTIC VARIABLES SELECTED INFERENCE PATTER>;S FOR 

BILINGUAL:- (;R..\DLS SIK TO TEN. Jurdak, M. E. Educational Stud ies 
in Mather tics, vS n2 , pp225-2J8, August 1977. 

Expanded A: '.ract and Analysis Prepared Especially for I.M.E. by 
Thomas J. Li./ney, University of Georgia. 



1 , Purpose 

The investigator sought to identify the relative importance of 
structural and linguistic variables in assessing students' ability to 
deal with inference patterns and to investigate whether different 
scructiirai variables were associated with the native (Arabic) or foreign 
langu.'ige (English) used for bilingual subjects. 



- . Rat ional e 

Previous research has focused on the nature of students' errors 
in using inference patterns and on the identification of structural 
and linguistic variables that account for the variance in students' 
performances. The present study represents a continuation of this line 
of research. 



3. Research Design and Procedure 

Test . The f.^st consisted of 64 items based on four sequences (two 
of propositions and two of propositional function:--' as i^ icated below: 

If Samir's car is a police car, then it is black. 

If Sam.ir's car is black, then it is com.f or table . 

« 

If Samir'p hook is an arithmetic book, then it is new. 

If Sam.ir's book Is a new book, then it contains beautiful 
pictures . 

Ail the new books contain beautiful pictures. 
All the ariLiim^eci:" books are new. 
All police cars c:re Llack. 



Al3 bJack cars are comfortable. 



Format I 



Premise 



(p)-> (q) 



Premise 



(q)-^(r) 



Premise 



(p) is true 



Quest ion 

Is (q) trueV 



or 



Is q(x) true 
for all X? 



Responses 
yes. 



m.aybe yes> 
maybe no 



Fo rm.a t 2 



'rem.ise 



Pr em.ise 



Premise 



Vx(p(x)^q(x) ) vx(q(x)-^r (x) ) p(x) is true 



Ques t ion 
Ts q(xQ) true? 
or 

Is q(x) true 
for all X? 



Responses 
yes. 



maybe yes, 
maybe no 



Variables . Four structural variables were defined: 

Decidability (D): An item was decidable if a yes or no response 
could be given rather than m.aybe yes or m.aybe no. 

Type of Lopic (L) : An item was categorized as belonging either 
to the propositional calculus or to the predicate calculus. 

Length of Sequence (S): The sequence length of an item was 
determined by whether there were one or two propositions (or 
propositional functions) involved. 

Overgenerali zation (0): An item had overgeneral iza t ion if 
the premises were propositions and the questions were about 
the corresponding propositional functions. 

There were 12 linguistic v ..-iables in English and 13 linguistic 
variables in Arabic. In general, these variables involved the number of 
occurrences of such gramjnat ical com.ponents as nouns, verbs, articles and 
pred icates . ' 

Three other variables used were: Existence of ^all* in the con- 
clusion (A); Negative occurrence of ^all' (NA) ; and Place of negation — 
that is, in the third prem.ise or conclusion (PON), 



area in Lebanon. Three of the schools were for gills, one was for boys, 

and one was co-educational. Two schools were lower middle class and three 

were upper micldle class. The subjects were bilingual with Arabic as the 
native language and English as the foreign language. 

Statistical Analysis . A stepwise multiple regression technique was 
used. "The criterion variable was proportion correct. A separate ajialysis 
was made using structural variables for each grade alone and for all grades 
combined in each of the two languages. A second analysis was made using 
both structural variables and linguistic variables in each of the two 
languages foi. each grade and all grades com.bined. The significance of a 
variable was determ.ined by the F ratio assoc iated -wi th the regression co- 
efficient at the last step of the analysis." 

4 . Find ings 

The four structural variables accounted for 43% to 60% of the variance 
for the various grade levels using the Arabic version. These variables 
accounted for 33% to 63% of the variance for the grade levels using the 
English version of the test. The variable Decidability was significant 
for each grade level. Overgeneralizat ion was significant for grades 8, 9, 
and 10 but not for grades 6 and 7, The variable Type of Logic was sig- 
nificant only for grade 8, English version. Length of sequence was not 
significant for any grade level in any language. 

The increase in variance of proportion correct accounted for by 
including linguistic variables was 21%, 20%, 16%, 17%, and 19% in Arabic 
for grades 6 through 10 respectively, and 8%, 6%, 10%, 16% and 12% for 
the sam.e grades in English. 

Decidability was the m.ost im.portant single variable for each grade, 
for all grades combined, and in each language. The significance of the 
variable Overgeneralization seem.ed to be related to the linguistic setting 
in which the variable was cast. 

Some linguistic variables were significant for both Arabic and English. 
The variable PON was significant for m.ost grades in both Arabic and English. 

5 . Interpre tat ions 

Linguistic variables in a foreign language seem to be a more im.por- 
tant predictor of item, difficulty for older children than for younger 
children in that age range 11 to 16 years. This does not appear to be 
true for the native language. Pert>aps "syntactical structures reach 
stability in a fG:*eign language later than those of a native language." 

Given that the variable D was significant for all grade levels, it 



It appears that the ability of students in grades six through ten to 
recognize valid principles is not a unitary ability which develops with age 
and levels off at a fixed age. The determinations of the validity of in- 
ference patterns is influenced by certain linguistic cues as well as the 
formal logic involved. 

Cr it leal Comjnentary 

The invest: gator conducted a well-conce ived study that focused on 
important variables in analyzing students' ability to deal with inference 
patterns. Considerably m.ore inform.ation was provided in the article than 
was possible to report in .this abstract. Apparently the investigation is 
part of a series of investigations dealing with students' understanding 
of inference patterns. 

It would have been helpful to have seen a specific test item, but 
none was given. Further, it would be interesting to know how m.any item.s 
the students were getting correct. Such inform.ation was not provided and 
hence the reader cannot determ.ine just "how well" students understand 
inference patterns. 

The linguistic variables were defined in basically quantitative term.s 
(e.g., the num.ber of predicates with possessive nouns in the item.). Sub- 
sequent investigations might consider qualitative aspects of the language. 
For exam.ple, the proposition, "If Sam.ir's car is a police car, then it 
is black," could be presented in a form, which does not utilize the "If- 
then" format, e.g., "Since Sam.ir's car is a police car, it m.ust be black," 
or "The car is black because it is Samir's car and it is a police car." 
Words such as "m.ust," "since," "because," and "provided that" could be 
used appropriately to indicate the sam.e im.plication . Many of the linguistic 
schemes for conveying im.pl icat ions in the classroom do not empp.oy the "If- 
then" form.at. 

Finally, it m.ight be interesting to follow up selected students with 
interviews to gain more in-dep':h knowledge on ways that the students dealt 
with inference patterns. 



NUMBER P/\TTERNS: DISCOVERY VERSUS RECEPTION LEARNING. O'Brien, Thomas 
C; Shapiro, Bernard J. Journal for Research in Mathematics Educ aiiion, 
v8 n2, pp83-87, March 1977. 



Expanded Abstract and Analysis Prepared Especially for I.M.E. by Gail 
Spitler, The University of British Columbia. 

1 . Purpose 

The purpose of the study was to com.pare the efficacy of two types 
of discovery learning with recep'.ion or expository learning as m.easured 
by an immediate posttest, a retention test, and a transfer test. 

2 . Rat lonale 

The study was undertaken to exam.ine the effects of discovery versus 
reception learning under carefully defined and, hence, replicable conditions 
The authors correctly point out that lack of repl icab il ity has been a fault 
of many studies related to discovery learning. The authors also relate 
the research to Piagetian theory, em.phasizing the importance of problem.s 
and problem.-solving (as opposed to exercises and drill) and the im.portance 
of social interaction in learning. 

3 . Research Design and Procedure 

Forty-five full-tim.e elem.entary or secondary school teachers enrolled 
in an introductory research m.ethodology course, a requirem.ent of the 
master's degree program in a university-based school of education, were 
randomly assigned to o|ne of three treatment groups. These groups were 
(a) discovery-T, (b) d iscovery-II , and (c) reception (interchangeably 
referred to as expository) . 

All treatm.ents involved (a) an initial learning period of 30 
minutes followed im.m.ediately by a 15-m.inute posttest, (b) a 15-m.inute 
retention test adm.inistered three weeks later, and (c) a 15-minute 
transfer test administered four weeks after the initial learning session. 
The initial learning task and the retention test involved the iden- 
tification of num.ber patterns in a standard addition grid. The trans- 
fer test involved the identification of num.ber patterns in a standard 
multiplication grid. The subjects in the discovery-I group were random.ly 
assigned to small groups of three subjects each. Following the initial 
presentation of the addition grid and a discussion of two sam.ples of num.ber 
patterns, the ins^uctions were given. 

The discove^y-r groujp was instructed to find as m.any patterns as 
they could while/ working (in groups of three. The discovery-II group 
found as many patterns as they could, working independently. Both 



All groups received a special sheet on which to record the patterns 
identified . 



Immediately following the initial learning period, all groups were 
given 15 minutes in which to list as many patterns as they remembered 
and as m.any new patterns as they could find. They were told to work 
alone without referring to their notes. 

Three weeks later, the addition grids were again distributed along 
with the sam.e instructions as described above. Four weeks following the 
initial treatment, multiplication grids were introduced and the subjects 
were instructed to find as m.any patterns as they could working independently 
for 15 minutes. 



4 . Find ings 

The dependent variable was the number of patterns identified by 

eazh subject on each test. Table 1 gives the m.ean and standard deviation 
for the posttest and the retention test. 

Table 1 

Addition Grid: Means and Standard Deviations 
of Number of Patterns Cited 



Treatm.ent 



Time Discovery-I Discovery-II Expository 

(N = 15) (N = 15) (N = 15) 



Posttiest 


9, 


.00 


6.80 


10, 


.87 




(2, 


.34)* 


(1.42) 


(2. 


.82) 


Retent ion 


12. 


.27 


6.93 


6. 


,33 




(3. 


53) 


1.65) 


(2. 


15) 


*Figures 


in parentheses 


are the 


standard deviations. 







Although no significant differences were found across trials, 
treatment and interaction effects were significant. :he Scheffe' a 
posteriori analyses of the group means showed significant posttest dif- 
ferences in favor of the expository group over either the discovery-I 
or the discovery-II group, with no significant differences between either 
of the latter. On the retention test, the discovery-I group significantly 
out-performed the discove^y-ii group and the expository group. A simple 
one-way analysis of variance and subsequent Scheffe analyses of the 
results of the transfer task indicated that the discovery-I group out- 
performed either the discovery-II or the expository group, with no sig- 
nificant differences between the latter two groups. 



their responses to the patterns actually exhibited on the videotape on 
both the posttest and the retention test. The discovery-I group suggested 
four incorrect patterns on both the aforementioned tests, while all the 
patterns offered by the discovery-II group were correct. The results of 
the transfer task; revealed three, three, and five incorrect patterns 
offered by the discovery-I, discovery-II, and expository groups, respec- 
t ively, 



5 . Interpretations 

The authors suggest that this research raises several questions. 
First, reception or expository learning may be far less effective than 
discovery learning when long-term retention is considered. Second, in- 
dependent discovery may be less effective than group discovery. Such 
findings may indicate that individualized instruction has severe limita- 
t ions. 



Critical Comjnentary 

The authors should be congratulated for completing a study of 
discovery learning that is indeed well-designed ,. with appropriate 
statistical treatment. However, they may be overly optimistic about 
the replication of such a study. If one has a copy of the videotape used 
as the expository treatment and the specific sample number examples dem- 
onstrated prior to the treatments, then clearly the entire study is capable 
of replication, if such replication would be of value. However, without 
said tapes and examples, replication is questionable. In fact, the con- 
tent of the videotape raises several interesting questions. Is it possible 
that in reality, the two discovery groups actually acquired techniques 
for looking for number patterns while the expository group was only finding 
patterns dem.onst rat ed in a random fashion? If so, then the two discovery 
groups should be expected to out-perform the expository groups, especially 
on a transfer test. When one considers the broad question of developing 
related experiments, this study leaves the reader in a quandary — what 
guidelines were used to develop the treatm.ents, especially the expository 
treatment? How would a related study be designed if it involved other 
content areas? .Again, it should be noted that the experimenters fully 
acknowledge the severe limitations of this study and correctly identify 
many of the related questions, such as, under what conditions should 
discovery learning versus expository learning be used? 

However, it is this reviewer's opinion that this study suffers 
from a m.ore important limitation: lack of a conceptual basis. Research 
related to discovery learning is in dire need of conceptual fram.ewo. ks 
that define the treatment differences so that clearly related studies 
can be com.pleted and com.pared. Without such conceptual fram.eworks or 
models, one cannot develop related experiments that examine different' 
time spans, different content areas, and alternative tr^n.^^fpr rnnHi>.-or.o 



THE CONTEXT OF IN-SERVICE EDUCATION. Osborne, Alan; Bowling, J. Michael. 

An In-Service Handbook for Mathematics Education (Alan Osborne, Ed.). Reston, 

Virginia: NCTM, ppl2-58, 1977. 

SUPERVISORS AND IN-SERVICE EDUCATION. Osborne, Alan; Bowling, J. Michael. 
An In-Service Handbook for Mathematics Education (Alan Osborne, Ed.). Reston, 
Virginia: NCTM, pp59-71, 1977. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by William 
Geeslin, University of New Hampshire. 



1 . Purpose 

These articles report on two national surveys, one of public school 
teachers and one of public school mathematics supervisors, which were in- 
tended to reflect opinions on the present state of in-service education and 
the desired state of in-service education. 



2 . Rationale 

Historically, changes in teacher background variables have been accom- 
panied by changes in in-service programs. As teachers become better trained 
and continue in the profession longer, a ciifferent type of in-service pro- 
gramming may be desirable and necessary. "The changing nature of mathematics 
teachers — their professional aspirations, background, and experience — is .he 
largest factor of change that needs to be accounted for in considering in- 
service education for the future." The supervisor may be a determining 
factor in providing a program that serves teachers' needs. 

3 . Rese arch Design and Procedure 

A representative sample of public school teachers in the United States 
which reflected actual propoi'tions of elementary and secondary ':eachers and 
of student populations by state, district, and school was selected. A 
sample of 3964 elementary and 3482 secondary mathematics terichers w^s 
obtained by selecting schools randomly. School principals were contacted 
and given instructions for randomly selecting a teacher within that school. 
Surveys were returned by 2C, 7% (j^ = 821) of the elementary teachers and 
41.1% (N = 1431) of the secondary teachers.' NCSM supervisors (N 742) 
in the United States also were surveyed and 74% (N = 549) responded. The 
teacher questionnaires consisted of 147 short-answer items and 3 optional 
open-ended questions. Twenty-one questions provided data on background 
variables, 138 questions were comjnon to both elem.entary and secondary forms, 
and 49 items were used to generate parallel "what-is** and ''what-ought --to-be" 
questions. Supervisors received a similar questionnaire. 

Descriptive statistics, correlation coefficients, and chi-square tech- 
niques were uijed. For most: statistics a r.nnf?prvp M l^^ol r»f crm" f n r^^r^^^ 



used due to the large number of significance tests performed. 



4. Findings 

Most secondary teachers (60.3%) had taken at least eight mathematics 
courses beyond calculus and only 12.4% had taken no mathematics methods 
courses. In fact, many (55.9%) held a Master's degree. Elementary teach- 
ers had training in mathematics (68.9% had taken 3 or more college mathe- 
matics content courses). Only 10.8% of the elementary teachers had taken 
no mathematics methods courses, while 35.5% had at least a Master's degree. 
Most (77.9% of the elementary teachers and 83.3% of the secondary teachers) 
had more than five years experience. Statistics concerning tenure, salary, 
school setting, average number of students per class, and grade level 
taught were given. An NCTM journal was read by 31.8% and 61.1% of the 
elementary and secondary teachers, respectively. The vast majority would 
choose teaching as a profession again. 

Responses revealed that 82.9% and 70.8% of the elementary and sec- 
ondary teachers, respectively, had participated in an in-service program 
since September 1, 1973. The most frequent sponsor of in-service programs 
appeared to be the local school system. Ti^.achers indicated a need for in- 
service work. A majority of teachers had had positive experiences with in- 
service education, but many teachers were not satisfied with past in-service 
education. Common complaints were that past programs did not fit class- 
room needs and participants were not involved in topic selection. Thirteen 
complaints were noted by at least 20% of the teachei-s. The majority (53.7%) 
of supervisors devoted .only 10% of their time to in-service education and 
only 23.6% indicated that their school spent more then $5 per teacher on 
in-service programs. A listing of selected programs was provided. 

Analyses on nearly all of the "what-is" versus "what-ought-to-be" 
questions revealed discrepancies between the present situation and the 
desired situation. In most cases, respondents desired ''more" from the in- 
service program. For example, only 32% of the elementary teachers and 28% 
of the secondary teachers noted that their school building or school system 
had an individual responsible for in-service education in mathematics but 
a corresponding 84% and 83% felt there ought to be such an individual.' 
Large discrepancies were noted in areas such as follow-up in the classroom 
after in-service programs, single-topic all-day offerings, programs designed 
especially for mathematics, workshops on motivation and applications, new 
teaching methods, use of achievement data in determining needs, and pro- 
vision of release-time from the classroom to encourage in-service partic- 
ipation. 

Secondary teachers who noted the availability of an individual re- 
sponsible for programming were more positive toward in-service programs, 
had fewer complaints, and thought in-service attendance should be required. 
Secondary teachers who have participated in program topic selection are more 
likely to express satisfaction with in- service. Sumjner programs were not 



ice programs was related significantly tb twelve variables such as satis- 
faction, positive in-service experience, background, and complaints. Sat- 
isfaction variables were related posit ively, with experience in workshops 
focusing on computation. The topic of mathetnat ical structure seems not to 
be part of recent in-service training and tea^chers with less experience are 
unlikely to have participated in such a progratr- Nine possible purpoi^es of 
in-service education were all considered highlyXdesirable, but learning 
about new curricula was the most common purpose,\ Providing release-time 
from classroom duties was the primary method (and^post desirable for teach- 
ers) by which schools encouraged participation. Teachers who felt students' 
enthusiasm for mathematics had declined were less likely to believe achieve- 
ment should be used as a criterion of effectiveness of in-service education 
programs. 



5. Interpretat ions 

It was believed that non-respondents were more likely than respondents 
to be negative toward in-service education. Respondents were well trained, 
experienced teachers who were reasonably satisfied with their profession 
and employment. In-service programs should not favor elementary over sec- 
ondary teachers even though elementary teachers appeared more satisfied 
with programs. Considerable effort must be expended to satisfy the needs 
and interests of secondary teachers. Teacher's complaints must be considered. 
Only 4 of 49 discrepancies between the actual and desired states of in-serv- 
ice education failed to reach significance^ indicating need for improvement. 
The usefulness of the content and materials is extremely important since 
it affects teachers' perceptions of power, reward, and gripes. Important 
topics are ''computation, motivation, applications, learning difficulties, 
diagnosis, evaluation, articulation, remediation, and attitudes." The com- 
ponents of enthusiasm and entertainment are necessary. In-service education 
should be directed toward specific disciplines such as mathematics teaching. 
Supervisors must use effective political action to establish and maintain 
in-service programs with "protected dollars." Supervisors must be extremely 
careful in allocating resources. 



Critical Commentary 

In-service education is clearly an important topic and the articles 
with associated appendices contain a plethora of useful information, only 
a small portion of which could be included here. The articles should be 
read by everyone involved in mathematics in-service program planning. 
Nonetheless, the naive research reader should use extreme caution in rely- 
ing on the information presented. Reporting research in popular prose 
format is a difficult task at best. A very fine line exists between using 
non-technical language and interpretation unsupported by data. Correla- 
tional and chi-square techniques t -^mpt writers to make inappropriate 
"causation" remarks. Osborne and iiowling succumbed to these hazards on 
occasion as well as overemphasized their personal happiness/unhaoDiness 



tantly that I would use similar casual statements in proposal -justi- 
fications or NCTM talks. 

In essence, only descriptive results of opinion information were pre- 
sented. A historical account of in-service education does not suffice for 
a "theoretical framework." Sam.pling and survey techniques appeared sound. 
Were state supervisors included in the sample? Problem.s with some answer 
choices were acknowledged. The use of terms such as variance in both the 
common language sense and the statistical sense should be avoided (p. 35). 
At times actual responses (p. 42) and rankings were unclear (p. 43). The 
low response rate of elementary teachers and dependence on principals in 
the sampling process are causes for concern. The authors engaged in con- 
siderable unsupported supposition about the respondents It is not clear 
that in-service program.s evolve through attem.pts "to better serve the 
teaching and learning of mathem.at ics. Reliability could have been m.ore 
thoroughly discussed. What-ought- to-be questions m.ay result in "m.other- 
hood" responses. Shuffling between two articles and three appendices 
was difficult. Finally, en em.pirical connection between teacher opinion 
and teacher behavior/student learning are necessary future steps. 



THE QUALITY OF STAN'DARDTZED HIGH SCHOOL M/\THEMATICS TESTS. Petroski, 
Joseph M. Journal for Research in Mathematics Education , v9 n2, ppl37- 
148, March 1978. 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by James 
T. Fey and William J. Engelmeyer, University of Maryland, 



1. Purpose 

The objective of the study was to examine critically the content and 
technical qualities of m.athematics tests intended -for high school stu- 
dents in four areas: general mathematics, algebra, geometry, and applied 
mathematics. 

2 . Rat ionale 

Standardized mathematics tests are used extensively. Indeed, the use 
as a tool in important instructional decision-m.aking situations is increas- 
ing. However, criticism of the content validity, technical adequacy, and 
interpretability of m.any t sts is widespread, suggesting the need for care- 
ful analysis of comjnonly used tests. The present study is part of a larger 
effort based at the UCLA Center for the Stucy of Evaluation. 



3 . Research Design and Procedure 



The investigators collected over 500 specimen tests and supporting 
technical inform.ation for mathematics tests that are available and appro- 
priate for use in grades 9-12. After categorizing each test, or identi- 
.fiable sub-test, according to content and grade level of usage, each test 
was rated for quality on 39 criteria. The criteria or quality fell into 
four broad areas: validity, appropriateness for examinees, administrative 
usability, and normed technical excellence. Rating was done by a pair of 
evaluators with graduate training in education or psychology. The evalu- 
ators worked independently, basing their ratings on inspection of the 
tests and available supporting information provided by the test publishers. 

4 • Findings 

The results of test ratings are presented in a variety of sumjnary " 
tables, the basic format being percentage of tests which fit into each 
rat.infe category on each rated dimension. The overall impression from in- 
spection of these tables is that the vast majority of available tests are 
weak in many crucial aspects of test quality. Very few of che tests have 
content based on empirical procedures for item selection, measures of 
concurrent validity, or measures of predictive validity. * Most tests have 
low or unreported measures of reliability, norm groups that are not nation- 
ally representative, and low ratines on usefulness for ednn.nMnnnl Hpm'crinn- 



simplicity of raw score conversion to a coT.mon interpretable score. More- 
over, a very high percentage of the tests used reporting scales, such as 
grade equivalents or mental ages or novel scales, that have been severely 
cr it ized recently. 

5 . Interpretat ions 

The investigator's general conclusion is that current mathematics 
tests are deficient in m.any bnsic aspects of test quality, confirming the 
criticisms leveled at standardized tests from, many quarters. In addition 
to the standard criteria of validity, reliability, and interpretability , 
the available mathematics t^f^t inct:ruments seem of limited value in guid- 
ing educational decisions about individuals or groups. They are also 
seriously flawed by the limited range of outcomes m.easured, m.aking the 
problem, of curricular evaluation and/or compa^^^n extremely difficult. 
There is a suggestion to follow the NLSMA m.odel of parallel curriculum and 
test developm.ent in order to overcome the latter shortcoming in evaluation. 

Critical Commentary 

The investigator's rationale for exam.ining test quality cited three 
common criticism.s of standardized tests: (1) not covering som.e im.portant 
components of ability; (2) overtesting other components of ability; and 
(3) testing a unitary trait, rather than a com.plex set of skills. Un- 
fortunately, however, the study that was based on this problem definition 
does little to confirm, or deny the criticisms and less to suggest viable 
solutions. The rating staff consisted primarily of people unqualified to 
make th6 professional judgments necessary in assessing the extent co which 
current tests m.easure (in appropriate balance) the full range of im.portant 
school m.athematics objectives. Instead, we are offered a collection of 
technical quality ratings based on inform.ation supplied by test developers 
in their typical packages of com.m.ercial in f orm.at ion . As a result, even 
if the 39 indicators had suggested that available tests ;ire of high tech- 
nical quality, the fundamental controversy over validity would rem.ain. 

Because the inv€iS t igator reviewed all available standardized mathe- 
matics tests, the investigation was thorough. However, it is difficult 
to extrapolate the findings to well-k^^own tests. As the investigator 
indicated, accurate estimates should be compiled of the frequency of 
test purchase in order to separate m.ajor and minor ins trum.ents . 

There is a degree of ambiguity in trying to interpret the tables. 
Sinca all evaluations are recorded in tabular form, it is impossible to 
review the evaluation of any single test. A reference in the article does 
imply that information pertaining to the evaluation of individual tests 
may be available. 



INFORMATION MiVTCHING WITHIN AND BETWEEN AUDITORY AND VISUAL SENSE 
MODALITIES AND MATHEMATICS ACHIEVEMENT. Sawada, Daiyo; Janr.an, R. F. 
Journal for Research in Mathematics Education , v9 n2, ppl26-I36, March 
1^78. 

Expanded Abstract and Analysis Prepared Especially for I.M.E* by Peggy 
A. House, University of Minnesota. 

1. Purpose 

The study investigated the hypothesis that there is a predictive 
relationship between sense modality m.atching abilities and m.athematics 
achievement in fourth-grade boys. Of particular interest to the inves- 
tigators were auditory-auditory (A-A) , auditory-visual (A-V) , visual- 
auditory (V-A) , and visual-visual (V-V) matching abilities. Specif icall 
the investigation focused on four hypotheses: 

(a) There are significant correlations between mathem.atics 
achievement and m.odality matching abilities. 

(b) There are significant variations in these correlations 
across three IQ ranges. 

(c) The magnitudes of the correlations involving m.athe- 
matics and their variation over the IQ ranges are 
as great or greater than the correlations involving 
reading achievement. 

(d) The intercorrelat ions of the modality m.atching 
abilities show significant variations across the 
three IQ groups. 

2. Rat ionale 

The concern of the study was auditory and visual sensory n-odalit:;/ 
matching in which a stim.ulus • ^/as presented in one sense modality .ir..i 
the subject was asked to identify a second pattern in the same or the. 
other sense m.odality as being either alike or different. For exanpl^, 
a pattern of beeps (auditory stim.ulus) would be m.atched with a pattern 
of dots (visual). Such matching ability rs presum.ed to be important 
in school m.athematics where m.any learning experiences require th'j oo;^;! 
to perform higher-order matching. For exam.ple, the teacher says "four" 
and the pupil is to circle four objects in a picture, l-Jhile rescarr!>. 
in reading has supported the conclusion that A— V m.atching abilitv 
significantly predicts reading achievement, research reported In mat'!'' - 
matics education has not included m.odality m.atching ability eitner .is 
a cent ^1 concern in the design of treatm.ents intended to improve 



ri-.-ulin*:, tiuTC ;) r .^i i.r r i vu I a (. i onsiu p between sense riuxJaliLy matchin 

r.hilitie^; nnci ui.i L lu-mn L i c s ^:ic h i e-.'eirien t . 



Hese.irch Desig n and Procedure 

The Sf^ were 180 fourth-grade boys from 19 public schools in middle 
soe ioeronoip. u: areas of Edmonton, Alberta. Ss with "identifiable :.jmily, 
personal or disability problems" as judged by principals, teachers, or 
r-iTi:mula t ive r^trords were eliminated from the population. The Ss chosen 
: pres.-Mfed a strat, ied random sample of 60 low- (IQ 71-90), 60 middlr- 

9i-lL0) and r.O high- (TQ 111-130) ability pupils as measured bv the 
vernal [(.) score of the Lorge-Thorndike Intelligence Test. 

Four tes::.- (A-A, A-V, V-A, V-V) were administered to the ^s . Each 
test coni^.isted of 35 item.s of sam.e-dif f erent m.atching, the first five 
items bein.^ practice items only. The reported KR-20 reliabilities of 
t[-iese tests were froi? .60 to .84, a range judged by the invesui. ?*:ors 
as "not except ionallv high, but adequate." The auditory patterns employed 
1000 cycle tones wlL variations caused by patterns of short or long 
pauses. Visual patterns consisted of displays of variously spaced dots. 
Tests were adroin i s t ered to Ss in groups of four to six. In order to 
avoid transfer effects from taking the four tests in succession, the 
investigators used a 4 x 4 Latin square design to vary the sequence in 
which the tests were administered. 

Four achievement m.easures were selected for testing the hypothesized 
correlations. Three m.easures of verbal achievement were subtests of 
the Stanford Achievm.ent Test, Form W (1965): word meaning, paragraph 
iT^eaning and word study skills. Mathematics achievement was measured by 
a test developed 'v.- the Edmonton Public School System, to assess "under- 
standing of basic \-ithm.etic operations including num.ber lines, relation- 
ships, numeration, j^eometric figures." The m.athematics test has a 
KR-20 reliabilitv ^-".-timate of 0.87. Data analysis consisted of corre- 
lations between the fou^ achievem.ent m.easures and the four m.odality 
matchini-^, tcs*s for tiach IQ group. 



Find im:\ 



For the low IQ group, the investigators reported a significant 
correlation (r ^- 0.56, p<.01) between m.at hem.a t ic s achievem.ent and 
A A matching only. For the high IQ group, the correlations between 
m.athematics achievement and each of the four m.odality m.atching tests 
were significant (r = 0.39, p<,01 to r = 0.30, p<.05). Mathem.atics 
achievement did not correlate significantly with any of the modality 
matching tests for the middle IQ range. These results were considered 
)2 ive sup[)ort to the first two hypotheses. 

Tlie only si^^nif icant correlation •« with reading achievem.ent occurred 
for the low IQ groups: hctwc'^.n word .ueaning and A-V matching (r = .29, 
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p<,05) line. ...'tween word study skills and V-A matching (r = .25, p<.05). 
That the ma^^nitudos ^ • tlieso correlaticns are less than the significant 
correlation., involving mathematics was cited as evidence in support of 
Hypothesis C. 

Finally, the invesci^ator s reported in t ercorrela t i ons amon^ A-A, A-V, 
V-A and V-V performance for each IQ level. The correlations ut A-A 
perforrjance with the other matching abilities ranged from .18 to .49 and 
generally increased with increasing IQ. All other correlations remained 
relatively constant over IQ groups. Hypothesis D }:^ained some support 
from this changing relationship of A-A to the other modality m.atching 
abilities. 



^ ' Interpre tat ions 

The relationships of modality m.atching abilities and m.a t hem.a t ic s 
achievem.ent appear to depend upon the type of m.atching and on the IQ of 
the pupil. As IQ increases, A-A matching appears to become more highly 
correlated with other m.odality m.atching abilities. This m.ay enable 
higher IQ pupils to com.pensate for inability in some A-A tasks by con- 
verting to other m.odality com.binations. This further suggests that in- 
struction which depends principally on listening m.ust be used carefully 
in teaching 1- IQ pupils, in particular when instructions given in 
the auditory m.ode .:111st be related to learning tasks requiring the use 
of A-V, V-A, or V-V processing of information. The changing nature of 
the interrelationships am.ong m.odality m.atching abilities with changing 
IQ levels also suggests that modality m.acching ability may be a useful 
variable to include in ATI exper im.en t s . 



Critical Comjnentary 

The study is based on an interesting paradigm and it is significant 
in raising important questions about the pupil's ability to translate 
within and between sensory m.odalitie . The prelim.inary findings rep- 
resented by the present study suggest a potentially fruitful area of 
research. In particular, results of modality matching studies can 
contribute to an understanding of the learning processes in a m.ulti- 
sensory learning environment such as the mathematics laboratory. 

The pattern of significant correlations between m.a them.a t ics 
achievement and m.odality m.atching ability for high IQ pupils raises the 
question of a possible relationship with the Piagetian developmental 
level of these pupils. It is possible that the higher-ability fourth 
graders have m.ore fully attained a h' her operational level that 
facilitates their modality matching performances while lower-ability 
pupils m.ay still be to some extent preoperational and less able to 
abstract m.atching patterns. , " ~ '\ 

The sample of f our th-^raci^e 'boy^ is, of course, very limited, and 
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extension of similar invest li^tnt ions to ntlier nr:es and both sexes would be 
useful. In considering future studies, several questions might be consid- 
ered : 



1. Thn IQ m-.^isure used to stratify Ss is verbal IQ, yet the 
achievement measure of primary interest is mathem.at ics . 
Would the results be similar if Ss were stiratified on a 
measure of mathematical ability such as quantitative 
reasoning or spatial perception? 

2. Potential subjects with "identifiable family, personal 

or disability problems" were eliminated, but the study re- 
ports neitlier the operational deCinitions of these problems 
nor the criteria by v/hich the eliminations were determined. 
Neither does it "jstabllsh the fact that the sample is not 
biased as a resul t of such exclus ions . 

3. not clear what is measured by the mathematics achieve- 
li;.- test selected for the study. Perhaps several m.athe- 
matics m.easures are needed to differentiate am.ong com.puta- 
tional ability, quantitative reasoning, or sim.ilar variables 
This might lead to a clearer delineation of possible re- 
lationships between aspects of m.athem.at ics achievem.ent and 
modality m.atching abilities. 

It is difficult i;o interpret the investigators' com.parisons 
of the magnitudes of significant correlations involving 
mathematics and reading. True, the m.agnitudes of the five 
signifi -ant m.athem.at ics correlations exceed, at least 
slightly, t'ne -^aunitudes of the two significant reading 
c . rrelcic ions , but the com.parisons were m.ade across IQ levels 
and types of sensory modality tests. It might be more mean- 
ingful, for n given 10 level, to focus on pairwise compar- 
isons of the correlation of m.athem.at ics achievement with a 
given modality matching type (for example, A-A) and the 
correlation of reading achievement with the same modality 
macching type. 



There .-Iso are questions arising from the investigators' conclusion 
that A-A matching ablLiLV is more highly correlated with other modality 
matching abilities as Lij increases. This is true for the cor re la t ions be- 
tween A-A and A-V. liowever, except for this one case, all intercorrela- 
tions among modality matching abilities are reported to be highest within 
the middle IQ range where none of the correlations between achievement and 
modalit> m.atching ability reached s igm :" Icance . Thi-: weakens the suggested 
hypothesis that high IQ pupils compensat.j tor A-A difficulties by switch- 
ing t o other sense m.odalities. 

The investigators have indicated several 
which they Intend to pursue. The i-i e i im inarv 
sent study warrant the encouragement of thos,^ 



1 ines of future research 
results reported in the pre- 
ef forts. 
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IN()UrRY STi^ATKCY , CO^NrTTVF STYLE, AND MATHEMATICAL ACHIEVEMENT. Scott, 
Norval C. Journal fo r Kesr Mr ch in Ma t t;e.ma t ics Educatio n, v8 n2, ppI32- 
143, March 1911 . ~ ' ~^ 

Expanded Abstract and Analysis Prepared Especially for I.M.E. by Joe 
Dan Austin, Emory University. 

1 . Purpose 

This study investigated whether teaching science using an inquiry 
method had any longitudinal effects on students' mathem.at ical achievem.ont 

2 . Rat iona le 

Suchman and others used an inquiry strategy iu science teaching. 
Research has Indicated that the analytical aspects of students^ cognitive 
style was significantly im.proved by this training. Further, this im.prove 
m.ent seem.ed to persist over tim.e. The "perceptual'* approach to cog- 
nitive style has an analytical com.ponent. Research has indicated a 
strong relationship between this analytical com.ponent and m.athem.at ical 
achievem.ent . Therefore iu is reasonable to expect that students exposed 
to inquiry becom.e m.ore analytical. Thus they should do better in m.athe- 
m.atics than students not exposed to inquiry teaching. 

3 • Research Design and Procedu re 

Two groups of high school students who had com.pleted high school 
geom.etry or algebra or both were selected for this study. The Inquiry 
Group (N = 39) had at least one year of sixth- or seventh-grade science 
taught using an inquiry m.ethod. The Com.parison (iroap (N' - 40) had had 
a sim.ilar course but taught in a traditional manner. Scores on the 
California Test of Mental Maturity (CTMM) indicated the two groups were 
initially com.parable on ability level. 

The inqu iry-or ien t"^d science iss used class investigations of 
"discrepant events". Students could ask the teacher only questions 
answerable by a "yes" or "no" response. Ones- Loning continued until 
the students felt that they could explain the experim.ental situation 
and results. Som.e of the conceptual ideas were considered comjnon to 
geometry or algebra as an explanation or proof was required to explain 
VKi^r the results occurred as they did. 

All evaluation was done five years after the sixth- or seventh- 
grade science course. The two gL 'Ups were com.pared on analytical 
"conceptual" style. The Sigel Cognitive Style Test (SCST) was used to 
identify each student's analytical "conceptual" style. The two groups 
were com.pared on both algebra and geometry course grades. The corre- 
lations b»'t'wr»'n analytical "c*onceptua I" style and course grack^s w.Te 
a 1 so i n ves L i g.i ted . 
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4 . Find ings 



A chi-square analysis indicated that analytical responses were sig- 
nificantly (p < .01) highe^' for the Inquiry Group. The Inquiry Group 
had significantly higher (p < .05) course grades in both geometry and 
algebra as indicated by t-tests. None of the correlations between ana- 
lytic responses and mathematics grades was significant (a = .20). 

Students were asked what they recalled about their sixth-grade 
science class. Seventy-eight percent of the Inquiry Group students 
recalled problem-solving activities, while only five percent of the 
Comparison Group students did. A questionnaire on the effects of 
inquiry was sent to 26 Inquiry Group students. The responses (N = 17) 
indicated stronjzer support for the effect of inquiry teaching in geom.etry 
than in algebra. 

. Interpretations 

The im.plicat ions were that this study 

(a) "... has raised som.e questions about the validity of 

the term, 'analyt ical ' when referring to a styl ist ic 
disposition. " and 

(b) "...indicates that senior high school mathem.atics 
competency was enhanced by this specific inquiry program.." 
(p. 143) 

The author also sugf^sts the need to investigate "field independency" 
and senior high m.athematics grades of students exposed to this inquiry 
program.. 

Critical Com.mentary 

This study investigates an interesting question on the relation- 
ship between a science teaching m.ethod and m.athem.at ical achievem.ent . 
The paper is clearly written both as to the procedures and analysis used. 
While the num.ber of subjects is sm.all, the author acknowledges this to 
be a pilot study and seem.s careful about generalizing from the results. 

This reviewer had several questions about the paper. The first 
concerned the initial equivalence of the two groups. Was the CTMM test 
in 1966 adm.inistered before the inquiry-oriented sc ience course? While 
this seem.s likely it is not stated in the paper. Also, it would have 
been desirable to have a m.easure of students' initial m.athem.at ical 
abilities to be m.ore comfortable with the initial com.parability of the 
groups. Another question concerns the entire section on students' re- 
call about their sixth-grade science class. Why wert-: only 26 of the 
39 Inquiry students given the questionnaire? Whv — with so few 
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studerT-s — were only 17 responses received? This plus the absence of 
any statistical tests r.ake these results somewhat suspect. 

In spite of these questions the main results seem, valid. If the 
results can be replicated, it would seem logical to try to adapt the 
teaching m.ethod to m.athem.at ics instruction. Would this have a stronger 
influence on m.athematical achievem.ent than the transfer from, science 
instruc t ion? 



THE UNDER .VANOING OF SIMILARITY AND SHAPE IN CLASSIFYING TASKS. Vollrach, 
Hans-Joachim. Educati onal Studies in Mathematics , v8 n2 , pp211-224, 
August 1977 

Expanded Abstract and Analysis Prepared Efpecially for I.M.E. by Edward 
M. Carroll, New York University. 



1 . Purpose 

Assuming i:hat similarity can be tested with classifying tasks, the 
purpose of the ::tudy was to provide evidence for answering the following 
questions: 

(a) What are S^s ' understandings of the expression "similar" 
as used in colloquial speech to apply to geom.etrical 
figures? 

(b) Does the "geometrical context" of the basic set of 
figures influence the results of the sorting process? 

(c) Is there a relationship between age and the results of 
the sorting problems? 

(d) Is there a relationship between sex and the type of 
sorting solutions? 

(e) How do the terms "similar" and "same shape" in the 
directions influence the results of the sorting? 

(f) What is the relat ion ship between t he arrangemen t of 
sorting tasks and the results? 



2 . Rat ionale 

The names of many geometrical concepts are used not only in geometry 
but also in colloquial speech. But as a result of a process of m.athe- 
matization in geometry, they are restricted by axioms and definitions. 
The names of some geometrical concepts are connected with images which 
do not belong to the geometrical concept (e.g., similarity, plane, curve, 
angle, etc.). Therefore when these names are used in geometry, underst.ind- 
ing of the concepts may be hindered because properties may be' retained 
^hich are not included Ln the p.oometrical concept and wiiich m,ay therefore 
lead to false inferences. 

Previous research investigations on sLuiilarity (Piaget and Inhelder, 
L948), on verbal context in developing understanding the meaning of un- 
cnown words (Kaplan, 1950), on judgments of attributes (Postm.an and Page, 
L947), and on f ree-obj ec t -sort ing tests (Gardner, 1953) were used by the 
luthor to develop the rationale for the study. 
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3. 



Research Design and Procedure 



Th'-ec' oxprx'iments (N = 110, N = 50, N = 30) were conducted using 
studencs ranp.ing ■''a ages from 8 years to 19 years, randomly selected 
from i.iftr^e.r z ^'''h • 2S in different schools in the northern part of Bavaria, 
FedercJ Rcp'.i....i' of ';..r7T?any, dur ine the autumns of 1975 and 1976, Six 
tests vere c^-^-- -iC using different sets of paper geom.etric figures; 
five ot the Fr'Ci . •.r'^ cuL from white paper and one set was cut from, checked 
(graph) ■. .-^t One included eight plane figures, four round, sim.ple 

closed c^rv*-*r -^.ci ':our m.any-sided polygons. Test Two included 15 plane 
figures, tlKe'^ /.tv/n of circles, squares, triangles, rectangles, and tra- 
pezia. Test -e ^. onsisted of 11 plane figures: five squares and six 
reciiangies, Tc>-'. l'(v-z contained nine plp.ne figures (rectangles). Test 
Five and Te.^t ^> i i . --ieu six rectangular plane figuref:^ with side-lenguh 
ratios of 1:2, — ^^nd 1:A, 

In geom^efy sl:nilarity is a relation between figures ^point set). A 
Figure is sinAlar to a Figure F^ there exists a sim.ilarity transfor- 
mation _S such tl\at ~ Z.2* follows that one rectangle is sim.ilar 

to another if and only if the ratios of the sides are equal. The sequence 
of sorting tasks was designed t.o guide the S^s to discover side-length ratios 
of rectangles. 

In each test all of the figures were placed on a table unordered. 
In Experim.ent I, for each of the six tests, the 110 _Ss were directed to 
"Put all sim.ilar figures in a hesp." In Experim.ent II, for each of the 
six tests, the 50 Ss were direcced to "Put all figures of the sam.e shape 
in a heap." In Experim.ent III, only Text 6 was adm.inistered to 30 _Ss who 
were directed Lo "Put all sim.ilar figures in a heap." There were no further 
comments or aids after the directions were given for each test in each 
experim.ent. After each test the results of the sorting were noted, the 
.materials were collected and the next test was offered. The results were 
classified by the frequency of occurrence. The solutions in Experiment I 
and Experim.ent II were com.pared. The chi-square test was em.ployed to test 
the hypotheses that thpre was no difference between the sorting results of 
the two groups in Experim.ents I and II respectively, for each of the six 
tests . 



4 , Find ings 

In each of the three exper im.ents , a rather large num.ber of solutions 
(10 ^ n £ 20) was found in each test. There was no solution am.ong the 
whole sam.ple which took account of the attribute of rectangles having the 
sam.e side-length ratio. Thus no one discovered the m.athem.at ical sim.ilarity 
as a principle of classifying the geom.etric figures. 

When the results of Experim.ent I and Experim.ent II were com.pared, 
it was found that: 

(a) A wide range ot different solutions resulted from each 
request . 
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(b) Varying the terms "similar" and ^'sam.e shape" in the 
manner tested did not lead to an overall .olution with 
respect to geom.etrical sim.ilarity of rectangles. However, 
in two of the subtests, Test Three (p < 0.05) and Test 
Four (p < 0.001), there were significant relations 
between the type of request (sim.ilar, sam.e shape) and 

the type of solutions. On the other four tests, the 
hypothesis that there was no difference between the 
results in the two groups could net be rejected. 

(c) There was a significant influence (p < 0.001) of 
the preceding testo on the type of solution and the 
arrangement of the tests, as was shown when Test Six 
was given in isolation. 

(d) Sex and age did not seem, to influence the solution. 

5 . In terpretat ions 

The request to put together similar figures is understood in m.any 
different ways. All sorting testr m.ight be explained by the personal 
styles of experiences. The Euclidean solution was obviously not noticed 
in these exper im.ents . Although "sim.ilar" in the Euclidean sense and "of 
the sam.e shape" indicate the sam.e geom.etrical relation, they are under- 
stood in different ways when used in sorting problem.s. "Sim.ilar" seem.s 
to tend more to attribuf^s of the figure, whereas "sam.e shape" tends m.ore 
to the "Gestalt." Teaching strategies for geometrical concepts wliich also 
have a meaning in colloquial speech should take into consideration the 
fact that the mere knowledge of the nam.e doe^ not include the full geom.et- 
rical understanding of the concept. It is necessary to standardize the 
understanding within the class by stating a definition. 

Grit ical Comjnentary 

There should be m.ore investigations of this nature, exploring how 
the names of m.at hemat ical concepts which are also used in colloquial speech 
affect the developm.ent of learning the m.athem.at ical concept. The issue 
is im.portant for the learning and teaching of geometry in elementary as 
well as secondary school m.athem.at ics . The problem has previously been 
investigated by Piaget and Minski, but the author fails to cite how his 
findings were com.pared to those previously known results. 

The report of the study is incomplete in the following respects: 

1) The m.ethod used to administer the tests is unclear: 
whether the tests were adm.inistered to large groups, 
to sm.all groups, or to individuals is unstated. 

2) The author alluded to results expected for the sorting 
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solutions from students who had not been taught 
formal geom.etry, but failed to indicate how many 
of his S_s (ages ranging from 8 years to 19 years), 
if any, had had a formal course in the subject. 

3) The results reported only that there was no re- 
lationship of age and sex related tc th^ S^s in 
the first experim.ent (N = 110, ages 8 years to 
15 years) who were requested to sort "sim.ilar" 
figures . Were sim.ilar results found relat ive to 
age and sex of the S_s in the second experim.ent 
(N = 50, 8 years to 19 years) who were requested 
to sort ''same shape'* figures? 

The concept of sim.ilarity is taught in the elementary and junior 
high school informal geometry. Em.phasis is on the "sam.e shape" with 
little attention given to the side-length ratio or to the proportional 
concepts. Maybe we should find out how elem.entary school teachers 
would respond to the ("sam.e shape", "similar") requests in these sorting 
tasks . 
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